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PREFACE 


This book deals with an up-to-date presentation of both 
theoretical and applied aspects of analysis of variance in 
univariate and multivariate cases. Various designs are 
explained with live data from different applied fields. The 
materials presented here are developed from the lecture notes 
of a one semester course at the graduate level on analysis of 
variance which I presented several times at the University of 
Montreal, and in parts at the Indian Statistical Institute, 
Calcutta, Forschungsinstitut der Mathematik, the University of 
Arizona and Dibrugarh University, Assam, India. Each chapter 
contains numerous related problems, complete references and 
worked out examples. The exercises of chapters 6 and 7 on 
multivariate analysis of variance also contain analogous result 
for the complex multivariate normal populations. 

Preparation and revision of the manuscript would not have 
been an easy task without the help of Dr. M.N.Das who helped 
me with great care and diligence. This book was written with 
the direct and indirect help of many people. I owe a great 
debt to my parents and brothers and sisters for their help. My 
wife Nilima, daughter Nabanita and son Nandan have been very 
helpful during the preparation of the book. I gratefully 
acknowledge their assistance. 


Montreal 
March 1985 
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CHAPTER 1. SOME MATHEMATICAL CONCEPTS 


The univariate analysis of variance, as commonly understood and 
practised today in applied sciences, has been developed chiefly 
by Professor R.A. Fisher, who first introduced the terms 
analysis of variance and variance components in statistics. It 
is a statistical technique for analyzing a set of observations 
depending on several kinds of effects, not necessarily 
independent and operating simultaneously, to decide which kinds 
of effects are of importance and to estimate them. Essentially 
it involves decomposing the total variation present in the set 
of observations into several meaningful variance components, 
representing different effects, which generate the observations 
and making suitable comparisons among these components. 

If each observation is a vector quantity depending on 
several characteristics, we get a multivariate analogue of this 
technique, and this is called the multivariate analysis of 
variance. In this chapter we shall present some basic 
mathematical results of matrix algebra and group theory which 
are essential to understand the material presented in this book. 

1.1. MATRIX ALGEBRA 

A real matrix A of dimension p * q is an ordered rectangular 
array of real elements a_, i = l....,p, j = 1, . ,.,q, and is 
written as 



The transpose of A is a q x p matrix A' = (a^J . A matrix A with 
p = q is called a square matrix. A square matrix A is said to be 
symmetric if A = A' and skew symmetric if A = -A'. Obviously the 
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diagonal elements of a skew symmetric matrix are zeros. 


Diagonal matrix :; A square matrix whose off-diagonal 
elements are zeros is called a diagonal matrix. 

A diagonal matrix whose diagonal elements are unity is 
called an identity matrix and is denoted by I. 

Triangular matrix; a square matrix A = (a. .) with 
a ij = °* J K± ia an upper triangular matrix. lf J a . . = o 
for j > i, then A is called a lower triangular matrix^ 

O r thogonal matrix ; A square matrix A is orthogonal if 
AA' = A'A =1. 

No nsingular matrix : A square matrix a whose determinant, 

denoted by | A| , is not equal to zero, is called a nonsingular 

matrix, if | A| - 0, then |a| i s called a singular matrix. The 

rows and columns of a nonsingular matrix are linearly independent, 

The sum of any two nonsingular matrices is not necessarily 
nonsingular. 

Theorem 1.1. Let A and B be two matrices of dimensions p x q 
and q x p respectively. Then | I + ab| 


and 


g x q respectively. 


= 1 1 + b a| 

q are identity matrices of dimensions p * p 


where 
and 


-V A 


■ i q' b 

B-I q 


A<I P 


But 


“V A 

/ 

-v a 






B ' I q 


Similarly 


V 


I p +AB,A 


r= 


B ' X q 


0 I 

q 


IV + M l- 
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-I t B 

q 

A< I_ 


= ll + BA|. 

1 q 


Q.E.D, 


If q = 1, then A is a column vector a, B is a row 
vector b and 


ll + ab'| = 1 + b’a. 

P- 

1.1.1. Minor and Cofactor : For any square matrix A = Ujj)' the 
minor of the element a.j .is the determinant of the matrix formed 
by deleting the ith row and the jth column of A. The cofactor 
A . of the element a^ of the square matrix A is given by 
(-l) 1 x the minor of a.^^ . The determinant of a submatrix of 
dimension i x i of a matrix A whose diagonal elements are also 
the diagonal elements of A is called a principal minor of order 
i. The set of leading principal minors is a set of p principal 
minors of order 1,2, ...,p respectively, such that the matrix of 
the principal minor of order i is a submatrix of the matrix of 
the principal minor of order i + 1. 


1.1.2. Inverse Matrix : The inverse of a nonsingular matrix A of 


dimension p x p is the unique matrix A -1 such that AA 


-1 


A _1 A=I. tet 
Then 


A ii 


denote the cofactor of the elements a . of 

U 


A. 



Theorem l t 2. Let A of dimension p * p and B 
q » q be nonsingular matrices and let u and V 
dimensions p * q and q x p respectively. Then 

(A + UBV) 1 = A 1 - A _1 UB(B + BVA* 1 UB)' 1 BVA“ 1 . 


of dimension 
be matrices of 


( 1 . 2 ) 
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P-C-QQf- . 


(A + UBV) [ A - 1 - A _1 UB(B + BVA -1 UB) -1 BVA _1 ] 

- - 

=1 - UB (B + BVA“ 1 UB)" 1 BVA'" 1 + UBVA -1 - UBVA -1 UB(B + BVA -1 UB) -1 BVA -1 1 
=1 - (UB + UBVA _1 UB) (B + BVA~ ^’UB) “ 1 BVA _ 1 + UBVA -1 
=1 - UBVA -1 + UBVA -1 
= 1 . 

From the uniqueness of the inverse matrix we get the result. 

Q.E.D. 

If q = 1, B = 1 and U = V' # then 

(A + UU') -1 = A -1 ^ A -1 U(1 + U'AU) -1 U'A -1 


= A 


-1 _ A -1 U U'A~ 1 

1 + U 1 A -1 U 


(1-3) 


1.1.3.* Rank and Trace ; A matrix A can be considered as a set of 
row vectors or as a set of column vectors arranged in a particular 
order. The rank of a matrix is defined to be the number of 
linearly independent rows (which is the seme as the number of 
linearly independent columns) that, it contains. 

The trace of a square matrix A = (a..) of dimension p x p 

p i J 

is defined by tr A = £ a. . . 

i=l 11 

Obviously for two matrices A,B, 

tr (A+B) = tr A + tr B, 
tr AB = tr BA. 

For any orthogonal matrix 6, 
tr A = tr A00' = tr 0‘A0. , 

1.1.4. Positive Definite Matrix and Quadratic Fnr n . Let A = (a .) 
be a P x p matrix and x = (x r ..., Xp ) • be a real p-vector. * 

Then i'Ai ^ is called a quadratic form in the real 

variables x..x . jj 

The matrix A or the associated quadratic form x'A* is 
ailed positive definite if *’Ax >0 for all x + 0 and is 



called positive semidefi nite ( Ath EMaTical CONCEPTS 5 

x'Ax 2 : 0 for all x, and x- JV***^** de finite) if 
- f ° r at le ast one x ^ 


1.1.5. Ch^acteristlc_Rootg_^^v e ** ** least one * + 0. 
^dominant role in the devel ^^^^ 2 ^ 1 These will play a 
r hyp° theses * multi variate general linear 

The characteristic roo ts of a s 

by the roots of the characterise ^ trix A are .given 

ls tic equation 


(1.4) 


A-II I = 0 

yhere X is a real constant. Obviously this i, U 

degree p in X and has exactly p roo * s an eqUation of 

matrix, then the diagonal elements J!° !' ” A is a ^agonal 

Since for any orthogonal matrix e characteristic roots. 

|A - -XI | = | 0 A0 ' - XI I, 

f 1.5) 

the characteristic roots of the matriv & 

_ „ e matrix A remain invariant with 

respect to the transformations 

A ■* 0A6 ' . 

\he vector * - (^.x p ) ■. not identically zero, satisfying 

^A-XI)x = 0 (1 . 6) 

is called a characteristic vector of the matrix A corresponding 
to the nonzero characteristic root X. Obviously it follows from 
(1.6) that if x is a characteristic vector of A corresponding 
t° X, then for any scalar C(+ 0), Cx is also a characteristic 
vector corresponding to X. , 

Theorem 1.3. (a) If A is a real symmetric matrix of dimension 
P x P/ then all its characteristic roots are real, (b) The 
characteristic vectors corresponding to distinct roots of A are 
orthogonal. 

Proof . (a) If possible let X be a complex characteristic 
^"oot of a and let x + iwhere x = (x ^.,..., x^) ' e ^= (y^/• • •»Y p ) 
are real p-vectors, be the complex characteristic vector 

corresponding to X. Then from (1.6) 

Y* - ix) 1 A(x + iy;) = X(x - ix) ' (x + ix). 

= Mx'x + z *x) . 


(1..7) 
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Since A is symmetric, 


(x - iy) 'A(x + iy) 

= x'Ax - iy'Ax + ix'Ay + y'Ay 
= x'Ax + y'Ay. 

As x'Ax + y'Ay, x'iS + X'X are real, we conclude that X must be 
real. 


(b) Let X r be two distinct characteristic roots of A 
and let x and y be the characteristic vectors corresponding 
to X^ and Xrespectively. Then 

AX = X^x#Ay = X 2 y. 

Hence 


x 1 Ay = X 2 X'y,y'Ax = X^'x. 

Thus X^y• x = X 2 x'y. Since \ ± X 2 we get y'x = 0 

Q.E.D. 


Let X be a nonzero characteristic root of a positive 
definite matrix A and let x(^= 0) be the corresponding character¬ 
istic vector. 

Then 


'Ax = Xx'x > 0. 


This implies 

that X > 0. 

Thus all characteristic 

roots of a 

positive definite matrix 

• / 

are positive 

• 


Example 1.1. 

Consider the symmetric 

matrix A of 

dimension 

6 

x 6 given by. 





/ 2.279 




\ 


/ -.510 

11.790 



\ 


1 7.715 

13.850 

340.200 




1 .446 

-.235 

2.604 

.184 



\ .310 

.20 2 

3.160 

.149 

.500 / 


\36.730 

-1.340 

164.400 

8.775 

7.333 705.1/ 

The ordered 

characteristi 

c roots of 

A are 

¥ 
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770.290 , 2 7 7.880, 11.120 , 0.469 , 0.303 , 0.0 59. 

Hie characteristic vectors of A corresporifling to the first 
three ordered characteristic roots of A are 

(.0482, .0048, .3576, .0119, .0104, .9325), 

(-.0215, .0504, .9326, -.0026, .0012, -.3568), 

(-.0362, .9977, -.0489, -.0218, .0139, .0156). 

1.1.6. Facfr .a riS.ation of Matrices ; We shall state several 

theorems, mostly without proof, on factorization of, matrices 
which will be useful in the sequel. The reader is referred to 
Girl (19 77) or any standard textbook of matrix theory for proofs. 

Theorem 1..4. For every real symmetric matrix A, there exists 
an orthogonal matrix 0 such that 0A0 1 is a diagonal matrix 
whose diagonal elements are the characteristic roots of A. 

Denoting by X^,...,X the characteristic roots of A, 
we get 

p P 

|A| = n A , tr A = Z A. . 
i=l 1 i=l 1 

Furthermore if A is positive definite, then > 0, i = l,...,p 
and hence |A ] > 0 ,. 

Let A be a positive definite matrix of dimension p x p 

with characteristic roots X-,...,x . Let be the diagonal 

1 P \ 

matrix with diagonal elements X*J#...,Xp and let 0^0=0. Then 
the positive definite matrix A can be factorized as A = CC', 
where C is a nonsingular matrix and the positive definite 
quadratic form X'Ax can be transformed to the diagonal form 

where y; = cx. Furthermore given any positive definite matrix 
A there exists a nonsingular matrix B (= C _1 ) such that BAB'=I. 
Since 

x'A" 1 x = (c _1 x) • (C _1 x) , 

we conclude that if A is positive definite ;a~* is also 
positive definite. 

Theor^>m 1 1 ^ L e t A be a symmetric positive definite matrix of 
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h a a x P matrix. Then BAB' 

V n and let B be a q P 

dimension p*P * ltive se mddefinite of the same rank 

is symmetric and at le P 


as 6 • 


Mt A be symmetric and at least positive 

and of rank r £ p. Th© n 


JSSiS of”dimenSion p * P and of rank r s p. Then A ha, 
TJly r positive characteristic roots and the remains p-r 
characteristic roots of A are zeros. 

Then there exists 


Theorem 1.7 . Let A 
a nonsingular matrix 


be a nonsingular matrix. 
C such that 


CAC 



where the order of I equals the nuntoer of positive character is ti 
roots of A and that of -I equals the number of negative 
characteristic roots of A. 


Theorem 1.8. Let 


and B be two matrices of dimensions p x c 


and q x p respectively. Then every nonzero characteristic root 
of AB is also a nonzero characteristic root of BA. The proof 
follows trivially from the fact that 


AI, A 


AI ,B 

B ,1 


A ,1 


Theorem l t | 9 . If A is positive definite and B is positive 

semidef inite of the same dimension p x p, then there exists a 

nonsingular matrix c such that CAC =1, C BC = D, where D 

xs a diagonal matrix whose diagonal elements are the roots of 
| B - A A | = 0 . 


.r 

Let A _ be ' a "atrix of. dimension p x q , p < 

A is less tha^T^a and .®° SitiVe sem idefinite if the rank 


a is less tha^T^a and .r >SitiVe Semidefiriite if the rai 

^p and positive definite if 

equal to p. ™ if the rank of a 

•“ «, 

“ «...»«.“ Z'Z.? “• 

y ai P x p matrix e 


SUrh 
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that 6AA«6■ is a diagonal matrix with nonzero diagonal 

elements A,..., A . Writing x = (x,.x ) *, y = (y.,...,y)' 

1 p l p 

= 0x, we get 

r ? 

x'AA'x = £ A.y. > 0 

" ” i=l 11 ' 

for all x. If r = p, then 


x'AA'x 


P 

l 

i=l 


A .y . 2 > 

i J l 


0 for all x | 0. 


Q.E.D. 


Theorem 1.11 . Let A and B be two r x p matrices of the 
same rank = min (r,p) . Then B'B = A'A if and only if there 
exists an orthogonal matrix 0 such that B = 0A. 


Proof. If B - 0A, then B'B = A'0'0A = A'A. To prove the 
converse, consider first the case r = p. Since the rank of A is 
p, the columns of A are linearly independent. Now B'B = A'A 
implies that 0 = BA ^ is an orthogonal matrix and that B = 9A. 
Consider now the case r > p. Since for any two p-dimensional 
subspaces of the r-space there exists an orthogonal transforma¬ 
tion transforming one to the other, we can assume that after a 
suitable orthogonal transformation, the p column vectors of A, 
B will lie in the same subspace and the problem is reduced to 
the case r = p. 


If r < p, we can without any loss of generality assume that 
the first r column vectors of A are linearly independent." 
Write A = (A 1# A 2 ), where and A 2 are submatrices of 

dimension r x r and r x(p- r ) respectively, and similarly 
B = (b 1 ,b 2 ) . Since B'B = A'A, we get 

B i B i = A i A i' B x B 2 = A 1 A 2 # B 2 B 2 = A 2 A 2* (1.8) 

By the previous argument B{B 1 = AjA x implies that there exists an 

orthogonal matrix 0= (B|) A| such that B 1 = 0A^. Also 

B 1 B 2 = A i A o implies that B = 0A.,. Obviously B'B n = A'A« with 
r-aa * " 22 22 

B 2 ~ 0A 2‘ 

Q.E.D. 
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*■•1*7. Idempo tent Matrix : A square matrix A is called 
idempotent if AA = A* 

theor em 1.12 .; Let A be an idempotent matrix of dimension 
P x p. Then, 

(a) if a is nonsingular, A = I, 

(b) if a is symmetric and if a^ = 0, all elements of the 
ith row and the ith column of A are zeros, 

(c) the characteristic roots of A are either unity or zero, 

(d) rank (A) = tr A, 

(e) if for any matrix B, A + B = I, B is idempotent and 
AB = BA = 0. 

Proof . 

(a) AA = A and A is nonsingular implies that A = A -1 AA 
= A _1 A = I. 


(b) AA = A (A = (a. .)) implies that 

lj 


P 

a . . = I a . .a . . 
11 j = l U J 1 


Thus if a. . =. 0 then a. . = 0, j = 1, 
ii ij 

symmetric, so a^ = 0, j = l,...,p. 


.,p. But A is 


(c) Let X be any characteristic root of A and let x f 0 
be the corresponding characteristic vector of A. Then 

2 

Ax = Xx, AAx = XAx = X X- 

So X z x = Xx. Since x ^ 0, we conclude that X = X, or X = 1 or 0 . 

(d) It follows trivially from (e) . 


(e) It is trivial. 


Q . E • D • 


1.1.3. Generalized Inverse (g-inverse) : If the matrix A is 
not a square matrix or if A is a singular square matrix then, 
obviously. A -1 does not exist. However it is possible to 
define a generalization of the usual inverse A“* by A - in 
the following way: 
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nof-jnition l.l. (a-inverse) ; Let A be a P x q matrix of 
nonzero rank. A g-inverse of A is a q * P matrix, denoted 
by A - , such that for any p-vector y and q-vector x # Ax = X. 
is a consistent equation and x = A~% is a solution. 

A” may exist even when A -1 does not and it equals A 
whenever A” 1 exists. It is not necessarily unique, although it 
is possible, by imposing additional conditions on A“, to define 

it uniquely. 

rfhoorpm 1.13 . Let A be a p x q matrix. Then A - exists if 
and only if AA A = A. 

Proof . Choose as the ith column vector a i of A. 

Obviously Ax = a^ is consistent and x = A - ^ is a solution. 

Thus 

AA - a i = Ax = a^ 

for all i. This implies that A = AA A. Conversely if AA A 
= A and Ax = ^ is consistent, then 

AA“Ax = Ax = y , or AA _ y = X- 
Hence x = A“y is a solution. 

Q.E.D. 

Let H = A - A. Then by the above theorem 
HH = A~AA~A = A”A = H. (1.9) 

Thus H is an idempotent matrix. Furthermore 

rank H = rank A. (1.10) 

In a similar fashion we can verify the following theorem. 

Theorem 1.14 . (a) A general solution of Ax = 0. is (H - I) Z 

where Z is arbitrary. (b) A general solution of the consistent 
equation Ax = y is A”y + (H - I) Z where Z is arbitrary. 

Theorem 1.15 . Let A" be the g-inverse of A. Then rank 
A = rank A - . 

Proof . Since the rank of A is not zero, there exist 
nonsingular matrices B, C such that BAC is a diagonal matrix 
D (not necessarily a square matrix) . Denote by D” the 
diagonal matrix obtained from D by replacing.^ its nonzero 
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diagonal elements by their reciprocals. Obviously, 

DD“D = D, D"DD~ = D. Consider A - CD B. 

Then i i i _1 

AA'A = B-W 1 CD-BB^DC' 1 = B V = A. 

By Theorem 1.13, A" is a generalized inverse of A. 
Obviously, rank (A) ^ rank A . Now 

A"AA“ = CD~BB" 1 DC" 1 CD"B 


=' CD DD B 


-1 


= CD"B = A . 

Thus (A")“ = A and rank A" > rank A. Hence the theorem. 

Q t E .D . 

1,1.9. Linear Equations : Consider the following system of p 
linear equations ih q unknowns x^, ...,x^ (reals) 

a ll X i + ••• + a lq X q =b l 


a pl X l + + ^ 


( 1 . 11 ) 


where a. ana b^ are known real constants. In matrix 
notation (1.11) can be written as 

Ax = b , 

where A = (a. .) is a matrix of dimension p x q and x = 

(x x . x)\b = (bi.b p )'. If b = 0, (1.12) is called a 

"homogeneous system" of linear equations. Otherwise, it is 
called a "nonhomogeneous system". 

A homogeneous system of linear equations Ax = 0 always has 
a solution, x = £ being a trivial solution. This system has a 
nontrivial solution x / 0 if and only if the q column vectors 
of A are linearly dependent. This follows from the fact that 
Ax = 0 can be written as 


x^ 1 + 


.. + x & q = o 

q— — 


( 1 . 12 ) 


^ '■a 

where denotes the jth column vector of A. 

Theorem 1.16 . The solutions of a homogeneous system of linear 
equations Ax = £ form a vector space. 
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In other words, if a r 

a +6 and ca , where c ~ tWO elutions of Ax = 0. then 

solutions of Ax = 0. ® “ constant ' are als ° 

Let X denote the veri-nr- ,, 

and let R(A) denote the vector^ ^ ^ S ° 1Uti0nS of ^ 

space spanned by tne rows of A. 

T heorem 1.17 . x is 

-- ^ gonal to R(A) and rank of X + rank 

New we consider tv*a 

. nhomogeneous system of linear 

equations Ax = b (b / o) . We f . 

. h . . “ irst observe that this system may 

not have any solution. Write Ax = b as 

1 i q (1.13) 

where 8 is the ith column vector of A. From (1.13) it is 

r rr ^ —^on i£ and 0 m y if b is 

linearly dependent on ft 1 i -i . ~ 

- ' A • • */q. This implies that the 

rank of A is the same as the rank of ( A ,b! , which is the 
matrix A augmented by the addition of the column b. 

_ ~ *“i' ■••• a q ) , 1 = (B 1 , ...,8 )' be any two solutions 

o x obviously a-B is a solution of Ax = 0^. on the 

other hand, if o is any particular solution of Ax = b and 6 

is any solution of Ax = 0 th#*n n + a < , ^ . 

- - tnen 2 + B is a solution of Ax = b 

Thus we get the following theorem. ~ 

SieorenuUlB. All solutions of Ax = b (if there be any) are 

obtained by adding to a particular solution of Ax = b all 
solutions of Ax = 0_. 

For finding a particular solution of Ax = b consider the 
associated homogeneous system 


Ax - bz = 0 , 

" (1.14 

obtained by introducing an additional dummy variable z. it can 
be easily verified that if the nonhomogeneous system Ax = b 
admits Of a solution then the homogeneous system Ax - bz = 0 
admits of a solution (x^, ... - t x^,z) with z *0. Such a solution 
can be found by sweeping out the matrix B = (A,b) . For details 
the reader is referred to Giri (1974). Write 
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x. 

V , = ““ , i = 1# • • • » Q • 

* i / z 

Clearly, ^(y., ...,y J ' is a particular solution of Ax = b. 

■*“ 'd 

If p = q and A is a nonsingular matrix of dimension 
p x p, then the nonhomogeneous system Ax = b has a unique 
solution x = A _1 b. For the case where A is a singular 
matrix or the general case where A is not a square matrix we 
can express a particular solution of Ax = b in terms of a 
"generalized inverse" of the matrix A. 

The concept of the generalized inverse of a matrix A is 
connected with that of the echelon form (sweep-out form) and 
the Hermite canonical form of the matrix A. 

Echelon form . A matrix A is said to be in an echelon form 
(or the sweep-out form) if and only if 

1) each row of A consists of zeros only or a unity preceded 

by zeros and not necessarily succeeded by zeros. 

2) any column of A containing the first unity of a row has 

the rest of its elements zeros. 

Hermite canonical form . A matrix A is said to be in a 
Hermite canonical form if its principal diagonal consists of only 
zeros and unities such that when a diagonal element is zero all 
the elements in the corresponding row are zeros, and when a 
diagonal element is unity all the remaining elements in the 
corresponding column are zeros. 

It is now obvious that a matrix in a Hermite canonical form 
is obtained from a matrix in an echelon form by an interchange 
of rows. 

Theorem 1.19 . Let A be a matrix of dimension p x q. Then 
there exists a nonsingular matrix D of dimension p * p or 
a nonsingular matrix E of dimension q * q such that DA = H 
or AE = h, where H is a matrix in the Hermite canonical form* 
If, in particular, A is a square matrix of dimension p * P» 
then H is a square matrix of the same dimension and HH = H. 

In other words H is an idempotent matrix. 
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Let tne rank of A be r. By Theorem 1.19 there exists a 
nonsingular matrix D of dimension p -v p such that 


DA = H = 

0 0 

where I is the unit matrix of dimension r x r and C is 
of dimension r x (q-r). 

Let the matrix d be partitioned as 
D, 


D = 


(1) 


( 2 ) 


( 2 ) 


where D (1) is made up of the first r rows of D and D 
is made up of the remaining (p-r) rows of D. The 
nonhomogeneous system Ax = b admits of a solution if D (2) b 
and the Herpiite canonical form of the matrix (A,-b) of the 
homogeneous system (1.14) is 


• D (l)b 


G = 


0 0 0 
The solutions of (1.14) are given by (see Giri, 1974) 


-(1) _C -(2) + (D (i)'b)z (1.15) 

where X(i) = ( x ^/•../ x r ) ' # x ( 2 ) = ^ x r +i' • • •' x q ) ' • For x 
and z = 1, a particular solution of the nonhomogeneous system 
Ax = b can be written as 


(2) " £ 


2 = 


D (l)- 


0 


= D*b 


where Di* is a matrix of dimension q x p such that Dt = d 

and °| 2 )b = .2. and 1 (1> 


D* = 


D* 
°( 1 ) 


°(2) 


Central Library 
B11371 


519.5; 


8 GIR(ANA) 


St. Xavier's College 
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Then D* is a generalized inverse of A. 
Example 1.2. Let 


A = 



-2 

5 


-3 



Since 




last row = -Uirst row + second row), 

A is a singular matrix and hence A 1 does not exist. Consider 
the following system of no'nhomogeneons equations in x^x^x.^ 



-2 

5 

-3 




Sweeping out the first and the second columns of A, we get. 



0 

1 

0 





Thus the equations are solvable if and only if Y l + Y 2 + *3 = 

for which the nonhomogeneous equations are 


For any y, / Y 2 ’ y 3 „ . 

solvable we get a particular solution by taking - 0, an 


this solution is 



x 3 = °> 


Thus the matrix 
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j.s a generalized inverse of A. As Yi + Y 2 + ^3 = ^ we coU '^ wr; *- te 
the particular solution as ^ 



so that the matrix on the right is another generalized inverse of 
A. Alternatively, we could put x 3 =y 3 to get a different 
particular solution as 



so that the matrix on the right gives another generalized inverse 
of A. 

Q.E.D. 

,1.1.10. Partitioned Matrices ; Let a square matrix A be 
partitioned into 
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= I A 


22 


A 11 " A 12 A 22 A 21 


Theorem 1.21 . A symmetric matrix 


A 


11 


12 


A = 


‘21 


22 


of dimension p x p (A^ is of dimension q x q) is positive 
definite if and only if 
definite. 


A ll' A 22 - A 21 A il A 12 are pOSitiVe 


£roof. Let x* = (xj-jj/ £( 2 )^' where -(1) = < x i' • • •' x q > 
is a p^-vector 


x'Ax = (x 


(1) + A 11 A 12^(2) ),A 11 ( -(1) + A 11 A 12 2 (2) ) 


+ (2) (A 22 ~ A 21 A 11 A 12 ) —( 2) * 


(1.16) 


If 


A is positive definite, then obviously A.^, a 22 

A ll' A 22 “ A 21 A U A 12 


positive definite. From (1.11) if 
positive definite, then A is positive definite 


are 

are 
Conversely. 


if A and consequently A.. 1 are positive definite, then by 
taking x^ 0.) such that X(u + A ii A i2^( 2) = —' we conclude 
that A 22 - A 21 A“^A 12 is positive definite. Q.E.d. 


% 

A 


Theorem 1.22 . Let a positive definite matrix A be partitioned 


into submatrices A_. , i,j = 1,2 where A^^ is a square 


submatrix and let A 


«1 




submatrices B^j, i, J = 1/ 


= B be similarly partitioned into 
Then 


A 11 B 11 B 12 B 2 2 B 21' 


A 22 B 22 B 21 B 11 B 12 / 


.-1 


A 11 A 12 “ B 12 B 2 2 * 


The proof is obvious. 


1.1.11. 

let 


Some Results on Derivative: 


Let x = (x , .. ,,x) ' 
l P 


and 


JL - ( 9 

3x v *dx/** ,# 

T 1 


3x_ 


For any scalar function f(x) of the vector x 
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ax 


3f 

3X-, 


3f 


= < ■^r:. 3x, 


Let A = ( a jj) a matr i x of dimension p x p and let f (x) 


«V 


= X 1 AX- 

* 4 Since 

2 

i ix — a. .x. t 2x. Z a. .x. + Z a xx 

X'AX i! 1 1 1J j ^ a iaW 




we get 

3f (x) P 

= 2 Z a. .x. 


ax 

Hence 

3f (x) 
~3 jT 


j=l 


= 2Ax. 


1J J 


Similarly one can verify that 


il£i = A. . , ^ l A L = A. . 

11 3a ij 1J 


/ 


\ where A.j is the cofactor of the element 


* J 

symmetric then 


iJ 


of 


If 


IS 


iki = 

3a. . 

li 


A ii' 


ilhl - 

8a ij 


= 2A 


iJ 


1.1.12. Complex Matrices : Here we shall discuss briefly complex 
matrices - matrices with complex elements and state some results 
without proof. These results will be useful to study th-. complex 
analogue of analysis of variance. The reader is referred to 
MacDuffee (1948) for proofs and related results. The adjoint A* 
of a complex matrix A = (a....) of dimension p * q is a q * p 
matrix A* = (a. .) ' where " - " denotes the complex conjugate 
of a . . and " 1 " denotes the transpose. For any two complex 
matrices A,B for which AB is defined, (AB)* = B*A*. A 
complex square matrix A is called unitary if AA* - I,, the real 
) identity matrix, and it is called hermitian if A = A*. A square 
lj% complex matrix A is called normal if AA* = A*A. A hermitian 
matrix A of dimension p x p is called positive definite 
(semipositive definite) if for all complex p-vectors, (nonnull) 
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c ?*AI > 0 (i 0). Since for any hermitian matrix A, <_£**!>* 

='i*Al, the hermitian quadratic fdrm assumes only real values. . 

Th^etn 1.23 . Let A be a hermitian matrix o£ dimension p * P- 

Then 

(a) all roots of the characteristic equation |A - XI | 0 are 

real, 

(b) if A is positive definite all characteristic roots of A 
are positive, 

(c) there exists an unitary matrix U such that U*AU is a 
diagonal matrix whose diagonal elements are the 
characteristic roots of A. 

Theorem 1.24 . Let A be a hermitian(positive definite matrix. 

Then 

\ 

(a) A can be uniquely factorized as A = BB* where B is a 
hermitian positive definite matrix, 

(b) there exists a complex nonsingular matrix B such that 
BAB* = I. 


1.1.13. Jacobian of Some Transformations : We merely state some 
useful results. For proofs the reader is referred to Giri 
(19 76) . 


Theorem 1.25 . Let V be a vector space of dimension p. For 
x, y e V, the jacobian of the linear transformation = 

(y^, ...,y ) ' = Ax where A is a nonsingular matrix of dimension 
p x P, is 



3y l 

9x 

3x '*‘’'3X 


Theorem 1.26 . Let the p x N matrix X be transformed to the 

p x N matrix Y = AX where A is a p x p nonsingular matrix,^ 

then the jacobian of the transformation X -► Y is given by 

il = IAI N 

8X • 

a£orsm_U27. Let the p x N matrix X be transformed to the 

P x N matrix Y = A x B, where A and B are nonsingular 

of dimensions p x p and N x u respectively. Then 




SOME MATHEMATICAL CONCEPTS 21 


the jacobian of the transformation Y -*■ x, is given by 


= A 


l N Ib l p 


'Theorem 1.28 . Let A be a symmetric positive definite matrix 
of dimension p x p. The jacobian of the transformation CSC'-hS, 

where C is a nonsingular matrix of dimension p x p is 
lo|P + 1 


* 


Note that if j is the jacobian of the transformation 
Y X, then the jacobian of the inverse transformation 
X -*■ Y is J -1 . 


1.2. GROUPS 

The Univariate and the multivariate analysis of variance models 
exhibit symmetries with respect to certain groups of transforma¬ 
tions. Solutions to these problems can be obtained in a 
straightforward way by direct application of group theoretic 
results. This section will deal with, a short review of relevant 
results of group theory. 

A group G is a set of elements associated with an 
operation t satisfying the following axiotris: 

t 

A 1 : For any two elements g 2 in G, g.xg 2 belongs to 

G. 

A 2 : For any three elements g^,g 2 ,g 3 in G 
g 1 x(g 2 xg 3 ) = (g^g^xg^ 

A 3 : (Identity element): There exists an element 0 in G 
such that for all g in G, gx0 belongs to G. 

A.: (Inverse element): For every element g in G there 

■ H _i _i 

exists an element g in G such that gxg = 0. 

The following properties of the group G are easy to 
establish. 

For g e G 

-1 -1 

1. gxg = g xg = 0*. 

\ 

2. gx0 = &xg = g. 
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3. For- g, h belonging to G, equation gxx = h in 
x has a unique solution x = g _1 Th. 


t and write 


For the sake of convenience we will omit 
g l g 2 for g l Tg 2* 

P ro d rt e g"°g t 2 " 10n Sh ° Uld “ With the -^ hm etical 

AtoU Sn group , A group G is called abeUan , f 
paxr of elements g r q 2 belonging to fl# ^ belongs ^ 

formslrSi^’ ThS Set ° f 311 nUmberS R Under addition 

with g - 1 . . g 9r ° UP Wlth Zer ° aS thS identit y element and 

all 071 “^ qr ° ,!C ' ^ X bS 3 Set and let G be the set of 
for X y b °°r “ aPPin9S 9 ! X ° nt ° X SUCh thab (i) if gx = gy 

for X, y belonging to x then x = y, and , - 97 

there exists a g e g sati«?f v - f ° r x ' y e X 

defined by, g.g x - g ( g x) 7 ? ~ 9X ' Wlth the group °I«ratioi 

group, and il^Ld th ! 2 9 *' 9 2 £ G < * * X . 0 forn* a 

d the permutation group. 

. l " x 

' ir x and Y are in x so is aa _ . .. 

talers a.b, . Fo r every x , * Line g x * ^ 307 

x e x. The set of all fornB an ^ ' V 

and is called the translation group. "" ^ 

“ i:: ■ •—»« 

““ ««. .s ’ r; :m * u 0 

group. The group operation is the natrix multiL 7 ^ * 
identity element being the identity matrix. ' the 

An elStnfS'af^ne X ^ 3 SPaCS ° f dimension "• 

g C G j( n) ,x e X t T G = (G * (n),X3 iS ^ paip ‘ g ' x >' 

1 X# The 9 r °up operation for th* 

is given by, t * le group G 

(g r X 1 )(g 2 , X2 ) _ (gi g 2(gX2 + Xi> 

identity elewent •* «*• 




- - —j or one 

is (1,0*) and the inverse element is given by 
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(g,x) 


-1 


, -1 -1 . 

(g / -g x) . 


a 

r 


1.2.1. Subgroup : Let G be a group and the group operation in 
G be t. If the "restriction of t to a nonempty subset H of 


G also satisfies all the group axioms A 1 -A 4 , 

then 

H 

is called 

a subgroup of G. 




Theorem 1.29. Let G be a group and Her:. 

Then 

H 

is a 


subgroup' of G if and only if (i) H is nonempty (ii) if 
g l' g 2 G H then g i g 2~ 1 e H - 

Proof . By (i) and (ii) if g , g 2 are in H so are' 
g^ 1 = 0 and g” 1 = eg" 1 . Thus gj^tg” 1 )" 1 = , g^ is aiso in 
H. The axiom A 3 is true in H as it is true in G. Hence H 
is a group. Conversely if H is a group, then clearly H 
satisfies (i) and (ii) above. 

Q.E.D. 


1.2.2. Normal Subgroup: A subgroup H of G is a normal 
subgroup if for all h e H and g e G 

ghg -1 e H, 

or equivalently gHg" 1 = H. 

Example 1.4 . Let x be a linear space of dimension n and 
let I be the identity matrix of dimension n x n. For 
g e G^(n) , x e x , 

(g,x) (I,x) (g,x) _1 = (g, x) (I,x) (g" 1 ,-g’ 1 x) 

= (g/X) (g" 1 ,-g“ 1 x + x) 

= (I,gx) . 

Since (I,gx) is an element of (I,x), we conclude that (I, x ) 
is a normal subgroup of (G^(n),x) . 

*•2.3. Quotient Group; Let G be a group and let H be a 
normal subgroup of G. Define g|h as the set of all elements 
0* = {gh|h e H}, g e G. Noting that for g 1# g 2 e G, (g^H) (g H) 
is the set of all elements obtained by multiplying, with 
respect to the group operation, all elements of g h with all 
elements of gji. we see that g|H forms a group; it is called 
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the quotient group of G(mod H) . 

The fact that g|h satisfies the axioms A^-A^ can be 
verified from the following: 

(i) g x H = g 2 H 

<=> ?2 1s 1 H = H 

<=> 9 2^1 « H 
<=> 9 X e g 2 H » 

(ii) Since H is a normal subgroup, g^Hg” 1 = H or = Hg 2 

( gi H) (g 2 H) = gi Hg 2 H = g^H. 

Hence g^H is in g|H. 

Ciii) H is the identity element in g|h. 

(iv) Since 

gHg = gg ^HH = gg -i H = H 
(gH) _1 = g _1 H. 

1.2.4. Homomorphism : Let G, H be two groups. Then a mapping 
of G into H is called a homomorphism if it preserves the 


group operations, that is, for g.^ and g in G, 


f(g x g 2 ) -f (g x ) f (g 2 ). 


The product 9^2 is defined with respect to the group operation 
in G and f(g^)f(g 2 ) is defined with respect to the group 
operation in H. 


Theorem 1.30 . (a) If 9 is the identity element of. G, f(0) is 

the identity element of H. 

(b) For g e G, f(g -1 ) = [f(g)] _1 - 

I 

Proof . 

(a) f (g) = f (g9) = f(g)f(6) 

(b) f(0) = f(gg _1 ) = £(g)f(g" 1 ). 

Q.E.D. 

1.2.5. Direct Product : Let G, H be groups and let G x H be 
the cartesian product of . G and H. With the operation <for 
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01' g 2' 9 i 9 2 are in G and h-^h^h^ are in H), 

(g^'h^) ^2'^2^ = ^^ 3 .^ 2 '^ 1^** 

G x H . forms a group and is called the direct product of G and H. 

$ 1.2.6. Transformati on Group; Let x be a set and let G be a 
| group. G operates from the left on x if there exists a 

function on G x x into x whose value at (g, x) , g e G, x e x 

is given by gx and if it satisfies 

,(i) 0x - x for all x e x / where 0 is the identity element of G. 

(ii) <3 1 Q 2 x = 9 x (g 2 x) . 

^ or ever V pair (x 1 , x 2 ) e x there exists a g e G such that 
5 x i = x 2 then we say G operates transitively on x from the 
left. It follows from (i) and (ii) that if for g. e G and x^,x 2 
e X». ' 

gx a = gx 2 = y, 

then 

g" 1 gx 1 = g _1 gx 2 = g _1 y, 
and x^ = x 2 = g -1 y. 

Example 1.5 . Let x he a linear space of dimension n. Then 
(n) operates transitively on x - {0}. 

* 5 # 

EXERCISE 1 

1. Show that for any two matrices A, B 
(A + B) ( A - B) = AA - BB 

if and only if AB=BA. 

2. Sriow that for any three matrices A,B,C, rank (AB) -h rank(BC )s 
rank(A) + rank(ABC). 

3. Show that for any matrix A, AA is symmetric if A is skew. 

4. A skew matrix with an odd number of rows is singular. 

5. / The determinant of a skew-symmetric matrix is nonnegative. 
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6. For any nonsingular matrix A, show that all its principal 
minors are nonsingular. 

7. Let A be a nonsingular matrix. Show that 

(i) if A is symmetric and positive definite, then there exists 
a unique lower triangular matrix T with positive diagonal 
elements such that A = TT'. 

(ii) There exists a lower triangular nonsingular matrix C such 
that A = CC'. 

8. If A is a square matrix, then there exists an orthogonal 
matrix 0 such that OA is a lower triangular matrix. 

9. Let A, B be two positive definite matrices of the same 
dimension. 

Show that 

|6A + (1-6)B| > | A | 6 | B | 3 - 6 , 0 £ 0 < 1. 

10. Let A^, ...,A^ be symmetric matrices of the same dimension. 
If A^Aj = 0 for i / j and if E A i is an idempotent 

matrix, then show that k. for each i is an idempotent matrix 
k k 

and rank ( E A.) = E rank (A.). 

i=l 1 i= i 1 

11. Let A be a symmetric positive definite matrix and let B 
be a symmetric semipositive ft definite matrix of the same dimension 
p*p and rank r < p. Show that 

(i) all roots of the characteristic equation |B - \A| = o are 
zeros if and only if B = 0, 

(ii) all roots of |b.- Aa| = 0 are unity if and only if B = A. 


12. Let 


E = 


1 P 
P 1 


• 0 , 0 < p < 1, 


Show that the largest characteristic root of E is 

x i = ° 2 u + (p-Dp), 
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and the corresponding normalized characteristic vector of I is 
(l//p,..., l//p). \ 

13. Show that for any lower triangular matrix A the diagonal 
elements are its characteristic roots• 

14. Show that an orthogonal transformation can be treated as a 
change of axes about a fixed origin. 

15. Let A, B be two matrices of the same dimension p x q. 
Then show that 

(a ) 9 ( tr A'B) = 3(tr AB') 

9B 8B 

where 

3B = ( 3b. } ' B “ tb ij J ' 

ij 

(b) if A is a square symmetric matrix, then 

aitr AA) _ 

_3 A “ 

16. Show that the set of all orthogonal matrices of the same 
dimension forms a group. 

17. Show that the set of all p x p nonsingular triangular 
matrices forms a group. 

18. Show that the intersection of two subgroups H and K of 
group G is also a subgroup of g. 

19. Show that every subgroup of an abelian group is a normal 
subgroup. 
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CHAPTER 2. BASIC STATISTICAL CONCEPTS 

L 


f Statistical inference in l inear 

f techniques depends on the partition e 1131,19 3nal T si = °£ variance 
several independent components each a “ ° hiS5uare variable into 
variable and on f and student-t st 1Stributed as a chisquare 

( shall first discuss the nonnull Sbatlst:LCS ’ ^ thiS chapter we 

The multivariate analogues of th 1S ribUtl ° ns of these statistics, 
considered to facilitate dit,-, GSe distribution s will also be 

variance. Most analysis of varUnL^rrcrlt^rarr 17318 • 
procedures with respect to certain nr invariant 

] leaving the hypothesis in qu^on transformations 

^ in question invariant. Thus to obtain the 

appropriate test criteria and to studv ■ 

a. ^ heir optimum properties we 

need to outline in some detai l +->.„ 

some detail the principle of invariance in 

statistical inference. This win =ie,« w 

is Wl11 also be done in this chapter. 

2.1. NONCENTRAL CHISQUARE DISTRIBUTION 
'' ^\ Let X l /# * wX n be independently distributed normal random 
iJvariables with E(x.) = y Var(x,) = a 2 and let u = I x 2 /a 2 

T* * 1 ii i_i i i* 

Sheorem 2.1 . The random variable U has a noncentral chisquare 

distribution X_T(A z ) with N degrees of freedom and the 

N N 2 , 2 

noncentrality parameter A = £ V./o., Its probability density 

i= 1 i i 

function is given by 


f U<U» 


exp {- ^(A 2 +u) } 1 00 (A 2 u) r(j+i$) 

-,- u* W E --:-—, if U > 0, 

y^2^ N j=o (2j) I.r(JsN+j) 


( 2 . 1 ) 


otherwise. 



Proof . Let » 4 - X^, i = 1 .Obviously Y ± 's are 

■ndependently distributed normal random variables with 

itY,) = = v . ( say) 

i 

# v Y J ■* and let 

bcl unit variance. Let Y = (Yn 
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z. = (Z r ...,Z N ) ' = OY 

where 0 is an orthogonal matrix of the following form 


O = 


( 


i In 

A ' * * * ' X 


o 


2i/... , 

• / • • • / 


0 


N1 


Then 


E(Z j .) = i 


X 

0 


0 


2N 


/ • • • g 


0 


NN 


if i = 1, 
if i > 1, 


N 


Cov 


= z 

o. . o., . 

j=l 

ij i • j 

N 

= Z 

0. . 0. , . 

j=l 

1J 1 ’ J 

r 

•l 

if i 


Var(Y .) + 
J 


N N 


* Z °ii °i ' i 1 Cov (Yj / Y . ,} 
j = l j ' = 1 1 J J J 


= < 


0 otherwise 


Thus Z x .Z n are independently distributed normal random 

variables with unit variance and with Efz^ =X, E(Z.) =0, i > 

2 N 2 

Let X = Z^, Y - Z Z . Then X, Y are independent. The 

i = 2 

probability density functions f x (x), f y (y) of x and Y 
respectively are given by, 


f (x) = — 
x 2/2 tt 


-h 


- X " exp (- ^(/x-X) 2 } + —-— x’^exp {-^(/x+X) 2 } 

2/2tT / 

00 


= ^— exp {-Mx+X 2 )} x 15-1 Z 


/2v 


j=o 


( 2 j) i ' 


( 2 . 2 ) 


f v (y> = 


F (4s(N-l) ) 2 I " 2(N " 1) 


exp {- ^y} yj? ^N- 1) - 1, 


whenever x > 0, y > 0. From (2.2), the probability density 
function of u = X+Y is 
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c _ exp{-^i ( A 2 +u) } U ^ N “ 1 y (X 2 )^ vr* r(j+k) 

U j= 0 (2j) ir (HH+j) 

y } .LM+jJ_ V *s + J-I a v) *5(n-1)-i d 

r o r (j +4) r (Sj(n-i)) v ll - v) 


- exp{-k(A +u) } kN-1 
/» 2 *" 


where u s 0, v = x/u. 


Remark; 


(x 2 ) j u j r(i+k) 


jlo (2j) ir^N+j.) 


Q.E.D 


1. Since for any positive integer k 

r(2k+i) = 2 K r(k+i) r(K+J $ ) / 

we can rewrite f^u) as 


V U) = J. 

k=o 


i ^ i ( j2v k 


exp{-kA >(^A 
k I 


exp{-k u} u 


kN+k-1 


r(-^N+k) ,2 2 


kN+k 


(2.3) 


= E p (k) f 

k= ° X N+2k 


(u) , 


where p K (k) is the probability mass function of the Poisson 


random 


variable K with parameter and f ^ is the 


N+2k 

probability density function of the central chisquare random 
variable with N+2k degrees of freedom. 

2. The characteristic function of U is given by 


^(t) = E(e ltU ) 

= ( l-2it) exp{ 2^2itf”^ 


(2.4) 


for real t and i = /TI. From the unicity property of the 
characteristic function it follows that if U^, are. 

independently distributed noncentral chisquare random variables 

‘ X N 2 (A 2 ), i = 1, then k U. is distributed as a noncentral 

L 1 k i=l 1 

wchisquare with I N. degrees of freedom and with noncentrality 

t k i=l 1 

parameter £ A . 

i=l i 
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3. From (2.4) it follows that 


E(x 2 (A 2 )) = N+X 2 , Var(x^(A 2 )) = 2 N+ 4 X 2 . 


(2.5) 


2.2. NONCENTRAL F, NONCENTRAL BETA AND NONCENTRAL t-DISTRIBUTIONS 
Let the random variable X, distributed as x m ( x ) and the randoni 

r\ 

variable Y, distributed as be independent and let F=nX/mY. 

Theorem 2.2 . The probability density function of F is given by 


f F (u) = 


U exp{-HX 2 } i 


(HX 2 ) j r (^(m+n) +j) ( (m/n)u}^ m+j ~~ L _ 


m v ^(m+n)+j 


l 


otherwise 


LJ . 

j=o r (j+D r (i$m+j) r (^n) (i+ -u) 

if U .2: 0. 

12 . 6 ) 

Proof . The joint probability density function of X,Y is 
given by 

f (x v) --- , , ttt ex P {-^(x+y+X )> 

' Y /?r(^n)2^ (m+1) 


Hm-1 %n- 1 ~ !X/x) J r (jjfO 

x y L (2j) ir (j +Hm) 

j=° 


From (2.7) the probability density function of F is 


f-(u) = 


- T ( Jh ~) -exp {-^X 2 } 

n/ff 2 ^ m+n) r(Hn) 


“ (X 2 ) j r(j+H) _ ( m u) j+Hm-l ; y >$(m+n)+j-l exp {_i i y(i+S u )}d^ 

j=o (2j)i rtj+f) n 0 


00 


(*sx 2 ) j r (h (m+n) + j) (^u )^ m+j1 

exp{-^X 2 } E ' ’ m v h (m+n) +j 

n j=o r (j+i) r (^m+j) r t^n) ( i-hju)^ 


(2.7) 


(2.7a) 


m 




provided that -n ^ 0 . 

Corollary. Let W = X/(X+Y) . The probability density functio 
W is given by, 

( 2 . 


00 


of 


.8) 


V“> “ k 2 o PkW Sm+kAn 


(oj) 


w bere p R (k) is as defined in (2.3) and ^ 
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probability density function of the central beta random variable 
with parameter (ijm+k^n) , that is. 


r(Mm+n) +k) J$m+k-1 v kn-l 

f w ( w ) = r(?m+kir(W (1-w)^ 

r provided 0 < w 1 • 

The proof follows immediately from Theorem 2.2. ■ The 
distribution (2.8) is called the noncentral beta distribution. 

^ Theorem 2.3. Let the random variable X, distributed normally with 
mean M and variance c , and the random variable distributed 

as a central chisquare with n degrees of freedom, be independent 
and let t — /n X/. The probability density function of t 
given by 


is 


. . n^ n exp{~m 2 } (n+t 2 )~^ n+1 ) 

V ' 


y f ih (n+j +1) ) A j 2 ^ t J 
J= ° J! (n+t 2 )^ ' 


- 00 < t < 


(2.9) 


0 otherwise, 

i 2 2 2 

where X = p /a . 



Proof. Let x 1 = /Kx/a; x 2 = Y/o 2 . The joint probability 
density function of x v X 2 is given by 

1 


X 2 ^ X l' x 2^ - 


• / ST2^ n r(i 4 n) eXP( ' !5nU l- > ^ X) W{-*SX 2 )x| n ~ 1 . 

Uhls the probability density function of t=x ] />'3t 2 is given by 
00 


f t (t) = / -. 


1 1 -^X 1 

"Tn- exp{-i$n(ty / x 0 -^nX) }e 

2^ n P (Jsn) 2 2 


2 Js(n-l) 


o /2n7r 2 ^T (%n) 
i , 


dx. 


exp{-^X z } 2 -4f— / exp{-^x 2 (l+~) }x!* (n+1+ J , “ 1 1 
''i™ 2' sn r() S n) j=o J1 o 2 n 2 


d> 




‘ The aistrib 


(n+t 2 ) * 5(n+1) i i£ r () 5 ( n+ j + 1 )) 

j=o J * 


2^ t j 
(n+t 2 )^ j * 


Q.E.D. 


>ution (2.9) is called the noncentral t-distribution. 
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2 ' 3 ’ sss!?sar distribijtion - quadrat ic form and 

The analysis of variance technique involves the decomposition of 
a quadratic form into several orthogonal quadratic forms. 

In this section we shall discuss some important results 
concerning the distribution of quadratic forms and in particular - 
Cochran's theorem for correlated random variables. Results for 
uncorrelated random variables which are useful for univariate 
analysis of variance will follow trivially from these. 

We will first outline the basic results concerning the 
multivariate normal distribution. The reader is referred to 
Giri (1976) for proofs of these results. 

The random vector X = (x r ...,X N ) • with values in E N 
(Euclidean N-space) is said,to have an N-variate normal 
distribution with mean vector E (x) = y = (y r ...,y N ) • and positive 
definite covariance matrix E = E (X - jj) (X-y) 'if its probability 
density function is given by 


f x (x) = ( 2 tt) 


l\ exp {-%trl 1 {g - y) (x - jj) • }. 


( 2 . 10 ) 


The characteristic function of X is 


E(exp{it'X}) = exp {it' y - \ t'Z t} (2.11) 

N t 

where t e E . Since the characteristic function determines 
uniquely the distribution function, the multivariate normal 
distribution is completely determined by the mean vector jj 
and the covariance matrix Z. 

If X has an N-variate normal distribution with mean vector 
U and positive definite covariance matrix I, then every linear 
combination L'X, L = (..., £^) 1 z E 1 ^, has a univariate normal 
distribution with mean L'j^ and variance L'ZL. The 
converse of this result, that is, if X with mean jj and 
covariance matrix E be such that every linear combination of 
components of x has a univariate normal distribution then £ 
has an N-variate normal distribution, also holds. This fact l ea 
to the following general definition of the multivariate normal 
distribution which covers cases where the covariance matrix 
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singular matrix and as a consequence f v (x) as given in (2.10) 

A “ 

does not exist. 





is 


N 

Definition 2.1 . An N-variate random vectoz X with values in E 
is said to have a normal distribution with mean Jj and covariance 
matrix £ if and only if every linear combination of the 
components of X has a univariate normal distribution. 


According to this definition, if X has an N-variate normal 
distribution then each component X^ of X is distributed as a 
univariate normal variable, so that -°° < E(x^)=y^< 00 and 

Var(X i ) = < °° for all i and hence Cov(X i ,Xj) = < 00 for 

i, j. Thus E ( x) and Cov(x) exist and we denote them by V, £ 
respectively. It is not necessary in this definition that £ is 
a nonsingular matrix. 


From (2.10) it is easy to conclude that if £ is a diagonal 
matrix then the components of X are independently distributed 
normal random variables. Let 


- (X (1) ' — ( 2 ) } ' ' 




-(2)" (X q+l' 


., X N ) 


( 2 . 12 ) 


and .y be similarly partitioned as _y = CJJ(i) ' an< ^ ^ 

be partitioned as 


£ = 


/ E i id ■ 

\ E (21) ’ 


E (12>\ 
Z (22 )J 


where £ 


(ij) 


- E( ^<i) 





Theorem 2.4. Suppose that _X has an N—variate normal 
distribution with mean v and covariance matrix z. Then 
Z (12) = 0 if and only if X (1) ,X( 2 ) are independently 
normally distributed with means # -^(2) covariance 

matrices £(n)'^( 22 ) respectively. 

Proof . The characteristic function of any linear combination 
L'X, L = U r ' e E N , is given by 

4>(t,L) - exp {itL'X) 
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A 

= exp UtL-y - l* t h) . 

Let L = (L (1 ) ^ L( 2 ) ^ ' where t(i) = 

E ( 12 ) = 0. Then 


A )'/ and let 


<t>(t,L) = exp {itL' (j) y (j) -4t 2 L| j) 


T. ... * L , .v } . 
(Jj) -(j) 


Since L is arbitrary, taking t = 1, we conclude that the 
characteristic function of X is the. product of the characteristic 
functions of x^ ^ and X^) and each one is the characteristic 
function of a multivariate normal distribution, and hence the 


theorem. 


Q.E.D. 


Theorem 2.5 . Let X = (X,, . . .,3^)' have an N-variate normal 
distribution with mean £ and covariance matrix £ and let Y = CX, 
where C is a matrix of dimension q x N of rank q. Then Y 
has a q-T;variate normal distribution with mean Cy and covariance 
matrix C E C '. 

Proof . For any nonnull q-vector L e E q , L'Y = (L'C)X. 

By definition 2.1, L'CX and equivalently L'Y has a uni variate 
normal distribution with mean L'C ^ and variance L'C E C'L. Since 
L is arbitrary, Y has a q-variate normal distribution with mean 
Cy and covariance matrix C E c' . 

Q.E.D. 

Theorem 2.6 . Let X have an N-variate normal distribution with 
mean jj and covariance matrix £ and let X, _y, Jj, be partitioned 
as in (2.12). Then 

(i) the marginal distribution of X(i) <3 -var i ate normal with 

mean and covariance matrix ^(n) # 

(ii) the conditional distribution of 2£(2) ^i ven that (1) 

is (N-q) - variate normal with mean JJ^) + ^(2) ^(11) ^*(1) 


) and covariance matrix ^ 22 ) 
Proof. 


(21) L (ll) L (12) ’ 


(i) Since every linear combination of the components of 

also univariate normal, by definition 2.1, jj has the 
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q-variate normal distribution with mean ii(D an ^ 
covariance matrix . . 

(ii) Let us first observe that any vector orthogonal to the 

columns of a ^- so orthogonal to the columns of E ( 12 ) * 


So the linear space spanned by the columns of ^ ( 3 . 2 ) ■'■ s a subspace 
of the linear space spanned by the columns of • Hence there 
exists a matrix A such that 


Z ( 21) " A E (ll) * 

Now by using the property E^ ^ ^n) ^(11) 

generalized inverse ^^ 1 ) ^( 11 ) we °k ta i n 


(2.13) 


E (11) 


for the 


Z (21) E (11) E (ll) “ A Z (11) Z (ll) Z (ll) " A Z ( 11) ~ Z (21)* {2 - 14 

Thus the covariance of X^ 2 ) - H(2) ~ ^(21) ^"(11) ^—(1) ” — (1) ^ 

is equal 


E -( 2 ) - U ( 2 ) £ ( 21 ) r ( 11 ) ( -<l)'-(l) )) ( -( 2 ) _JJ ( 2 ) -E ( 21 ) E (ll) ( -(ll) 

" -tl(l) )} * 

=Z (22) + E ( 21) E (11) E (11) E (11) E (12) ‘ 2E (21) E (11) Z (12) 

=I (22) _ E ( 21) E (11) E (12) ' 
and 

E -( 2 ) “ i±< 2 ) " Z ( 21 ) Z (ll) ( ^( 1 )"^( 1 ) ) ( ^( 1 )--H(i) ) ' 

= E ( 21) " Z (21) Z ( 11) E (U) 

= 0 • (2.15) 

From (2.14), (2.15) and Theorems 2.4 and 2.5 we conclude that 

— ( 2 ) -^( 2 ) * E ( 21 ) E (ll) ( -(l) -)i(i) ) ‘ and * (1) - Ji ( i, are 

independent, and the conditional distribution of X( 2 ) given X^ 

= x (1) is (N-q) - variate normal with mean jj ( 2) + E (n) E^ n) 

(x ,11 -Ml n) and covariance matrix r _y r- T 

-U) (1) r (22) i- ( 21 ) E (H) E (12) ’ 
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In the above theorems, if E is positive definite, then the 
generalized inverse becomes the true inverse. In such cases the 
definition always Guarantees the existence of the probability 
density function as given in (2.10) and readers may find it 
instructive to prove Theorems (2.4), (2.5) and ( 2.6) with the 
help of the probability density function. In the development 
which follows we shall assume E to be positive definite as it 
will be of direct use for analysis of variance. 


2.3.1. Distribution of Quadratic Forms 


Theorem 2.7. Lfet X = (X^, ...,X N ) ' have an N-variate normal 
distribution with mean jj and positive definite covariance matrix 
I. Then X'A* is distributed as noncentral chisquare with k 
degrees of freedom if and only if IA is an idempotent matrix of 
rank k. 

Note . If EA is an idempotent matrix then EA EA = eA. 
Since E is positive definite we can equivalently express the 
stated condition of the theorem as A E A = A. 


Proof . Since E is positive definite there exists a non¬ 
singular matrix C such that E = CC 1 . Write X = CY. From 
Theorem 2.5, Y has an N-variate normal distribution with mean 
v = y and covariance matrix I. Since X'AX = Y'BY, where 

B = C'AC, the theorem will be proved if we can establish that 
Y'BY is distributed as a noncentral chisquare with k degrees of 
freedom if and only if B is an idempotent matrix of rank k. 

To prove it let us first assume that B is an idempotent matrix of 
rank k. There exists an orthogonal matrix 0 such^that 0B0 is a 

diagonal matrix 



where 1 is the identity matrix of dimension k x k. 
= (Z r ...,Z N )' = 0Y. Then 


Let Z 
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Since 6 is an orthogonal matrix and the components of Y are 

independently distributed with unit variance, Z^,...,Z^ are 

independently normally distributed with unit variance. Hence 
k ~ 

£ Z. = z^'DZ^, and equivalently Y'BY has the noncentral 
i=l 1 

chisquare distribution with noncentrality parameter (0_v) 'D(0v) 

= V 1 0 1 D 0 V= V' B V. 

To prove that it is also a necessary condition let us assume 
that Y'BY is distributed as ) . If B is of rank m 

then there exist an orthogonal matrix 0 such that ,060* is a 
diagonal matrix with nonzero diagonal elements X^, ..., X m , the 
characteristic roots of B. We can assume without any loss of 
generality that the first m diagonal elements are nonzero. 

Writing Z = 0Y, 
m y 

Y'BY = £ X.z . 

i= 1 1 1 

2 

Since , i = 1, ...,m are independently distributed as noncentral 
chisquare random variables each with one degree of freedom and 
Y'BY is distributed as it follows frorti the uniqueness 

of the characteristic function that m = k and X. — 1, i = 1, ...,m. 
In other words 0B0< is a diagonal matrix with the first k 
diagonal elements equal to unity and the rest are zeros. This in 
turn, implies that B is idempotent. 

Q.E.D. 

Theorem 2.8 . Let X have the N-variate normal distribution with 
mean and positive definite covariance matrix £ and let 

X' £ _1 X = X'AjX + X'A 2 X, (2.16) 

where X' A i2 £ is distributed as noncentral chisquare X^(^) with 
k degrees of freedom and the noncentrality parameter X^. Then 
X'A 2 x is distributed as noncentral chisquare with N-k degrees of 
freedom and the noncentrality parameter y'E - ^j -X^. 

Proof. Let C be a nonsingular matrix such that E = CC' and 
let Y = c X* Obviously Y has the N-variate normal distribution 
with mean _v = C jj and I as the covariance matrix. Now 

Y'Y = Y'B-jY + Y'B 2 Y , 

where B^ = C'A^C, i = 1,2. Since C is nonsingular 
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rank (A.) = rank(B..) i = 1# 2 . 

The t h e orem wlll be' proved if we can show ^ Y . B y ^ 

ha th ^ ^ ^ n ° nCentral xi k <U' r\ - xh: since Y'B v 
has the noncentrai chisauare v 2 /i 2 i K .T 1 - 1 - 

conclude rw = niSqUare X k (Xfi, by the above theorem we 
a B x is an idempotent matrix of rank k. But 

and 


(I-B^ (I-B ) = i_ ?B + B j_ E 

± , 1 1 1 x a i’ ( 2 . 17 ) 

H_^ Ch 1 ^ lles that *~ B i is an idempotent matrix of rank N-k 
ce has the noncentral chisquare distribution with N -k 

greeo of freedom and with the noncentrality parameter 

—' Cl-PjJv = v.'v - v_' B 

= £' Z \ - JJ'C ' -1 C'Aj^CC' 1 ]! 

= ji' Z~\ - H'Axk . 

T h eorem 2,9 . Fisher-Cochr an's Theorem . Let X have the' E ' D ‘ 

N-dimensional normal distribution with mean ^ and positive 
definite covariance matrix E and let 


X' Z X = Q 1 + ... + Q k , 


( 2 . 18 ) 


where = X'A.jX, with rank ( = N., i = 1. k . Then Q.-s 

are independently distributed noncentral chisquares v 2 (u'A.u) if 

k k A N. 

and only if z N.=N, in which case u'Z _1 u = Z y'A u 

i=l 1=1 1 ~ 

Proof. Since Z is positive definite there exists a 

nonsingular matrix C such that E = CC' . Let Y = C _ 1 X.’ Then 

/ n i n \ *i 


(2.18) reduces to 
k 

Y'Y = Z Y'B.Y 
i=l 1 


r 2.l9) 


where B^ C’A^C, and Y has an N-dimensional normal distribu¬ 
tion with mean v = C -1 ^ and with I as the covariance matrix. 

Let us suppose that Y'B^Y, i = 1 . k are independently 

distributed as x N . Then £ Y'B.Y is distributed as a 
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noncentral chisquare with z NL degrees of freedom and the 

noncentrality parameter Z- v'B,v. Since Y’Y is distributed as 

2, , \ and (2.19) holds, we conclude from the uniqueness of the 
X N v - “ n k 

characteristic function that Z N. = N and Z = 

\ _ 1 i= 1 i=l 

that is, I E U* 

i = l 

To prove that these conditions are sufficient let us assume 
k 

that 2 N. = N. Since Y'B ± Y is a quadratic form in Y with 
rank(B^) = N^, using the arguments of Theorem 2.7 we can write 

N. „ 

Y'B.Y = l X + Z . , i = 1, ,k, (2.20) 


— l— 


“ iJ 


where Z^'s are linear combinations of Y^,...,Y N . Let 

Z = (Z 1]L , ,Z 1N , Z kl , Z kN )'. Then from (2.20) we get 

1 k 


Y'Y = Z' A Z 


( 2 . 21 ) 


where A is a diagonal matrix of dimension N * N with diagonal 
elements +1 or -1. Let Z = AY be the linear transformation 
which transforms the positive definite quadratic form Y’Y to 
Z'AZ. Since for all values of the vector Y 


Y'Y =Z'AZ=Y'A' A AY, 


we conclude that A'A A = I. In other words A is a nons^ingular 
matrix of dimension N * N. This implies that Z' A z is a 
positive definite quadratic form and hence A= I, A'A = I. Thus 
A is an orthogonal matrix and hence Z has the N-variate normal 
distribution with I as the covariance matrix. Hence from 
(2.20) we conclude that Y' B^Y, i = l,...,k, are independently 
distributed x^j (_v'B._v) - But Y'Y is distributed as . 

l 1 k 

H ence v’v = z v'B.v, that is, jj' Z _1 y = I y'A.^. 

i=l 1 i=l 1 


Q.E .D 
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Theorem 2.10 . Let X have an N-variate normal distribution with 

mean and positive definite covariance matrik I, and let the two 

quadratic forms X'A^X'A^X be distributed as (y'A^) and 

2 1 
X N (U , A 2 P) , respectively. Then X'A^, X/A 2 X are independently 
2 

distributed if A^ E A^ = 0. 

£roiQi. Using the notation of the above theorem we write 
X'A ± X = Y'B.Y, i = 1,2, 

where B i = C'A.C. Since Y 1 B i Y, i = 1, 2, are distributed as 
noncentral chisquare with N^ degrees of freedom we conclude from 
Theorem 2.7 that B^, B 2 are idempotent matrices of ranks N^, N 2 
respectively. Hence B^ = B^ i = 1, 2. Thus 

B 1 (I-B 1 -B 2 ) = (I-B r B 2 ) B 2 = .0 
if B^B 2 = 0. This implies that 

rankCB^) + rank(B 2 ) + rank (I-B 1 -B 2 ) = N. 

But 

Y'Y = Y ' B^Y + Y 1 B 2 Y + Y 1 (I-B ^-B 2 ) Y . 

By the Fisher-Cochran's Theorem the quadratic forms Y'B^Y, 

1 B 2 Y, Y* (I-B^-B 2 )Y are mutually independent. 

Note that B^B 2 = 0 implies A^EA 2 =0. Q.E.D. 

Theorem 2.11. Coc hran's Theorem . Let X a = (X ..._ X ) «. 

- — a 1 ap ' 

a = 1,...,N , be a random sample of size N from a p-variate 
normal distribution with mean £ and the same positive definite 
covariance matrix E.- Assume that 

N 

l Of) -x“= Q + ... + Q k , (2.22) 

01=1 1 

where 

N 

Q ± - E (X 01 ) -* a* X* 3 , i = !. k 

a, 0=1 

and A. = (a^) is of rank N... Then Q..’s are independently 
distributed as 
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N x +. . .+1T 


‘(z a ) 1 (z a ) 


(2.23) 


a=N .. . +N^ 1 


where z“=(Z..Z ) ' , a =1 . £ N are independently 

- al ap i=l 

distributed normal p-vectors with mean £ and’ covariance matrix 

k 

E, if and only if E N. = N. 

i=l 

Note . If p = 1, this gives the popular univariate version 
of Cochran's Theorem. 

Proof . Let Q.'s be independently distributed as in (2.23) 

1 

Then I Q. is distributed as 
i=l 1 


k i 
E E 
i=l oi=l 


E (Z a ) 'Z a . 


Using (2.22) and the uniqueness of the characteristic function we 

k 

conclude that £ N. = N. 

i=l 1 


To prove the sufficiency of the condition let us assume 

that E N. = N. Using the argument of the Fisher-Cochran 
i=l 1 

theorem we assert that there exists 


such that A i = and is of dimension N i x n. Let 

B = (b a g). Since B is orthogonal, 

a N o 

Z = £ 

6=1 “ B 

are independently distributed p-vectors, with mean 0 and 
covariance matrix £. Clearly 


Qi = Z (X ) ' a aft X 6 

3=1 


N.+...+N. 

1 ^ 1 ,(z a )' (Z a ) 

a=N 1 +...+N +1 


Q.E.D 


44 ANALYSIS OF VARIANCE 


Example 2.1. Let X^,be a random sample of si?e n 
from a normal population with mean y and variance o^. Let 

X = (x r .. .,X N ) ' , 



Observe that A^, A ^ are idempotent matrices of rank N-l and 
1, respectively. Furthermore A.^ = and 


- 2 -2 

X’-X = E (X.-X) + NX 

i=l 1 


= X’ A 1 X + X 1 a 2 X . 

N - 2 2 — 2 

Thus, E (X.-X) /a is distributed independently of NX as 

i=l 1 

X* _. Note that E(X , A 1 X) = 0. 

N-1 — I— 

Example 2.2. Let X^ ^ , j = 1,...,N^, i = 1,...,k, be a 

random sample of size N. from the ith normal population 

1 2 

with mean y^ and the same variance a . Let 


X- (X n ,...,X 1N , X 2 i» . . • / X 2 n '•**' *kl' * ‘ ^N. ) ’ 

N 1 ^ X 

, i 1 X n. k 

X. = 77 - E X.. ,X = t: E E 1 X. . , N = I N. , 

1 N i j-i « •• N i-i j-i ^ i=i 1 


and let the matr-i-ces A^jA^^^# each of dimension n * N » be 
given by 


A 3 = 


l 2 

N ' # * " N 


a 2 = B - a 3 / a 1 = I-B , 
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where B is a diagonal matrix in the block form with diagonal 
elements B / ..,&^ k ) * B (ii) ^ e i^g the N i x ISL matrix each 
of whose elements is equal to 1 /N i and I is the identity 
v matrix of dimension N x n. It is easy to verify that 
A 1 ' A 2' A 3 are a H idempotent matrices and rank (A^) = N-k, 

/rank (A^) = k-1, rank (A^) = 1. Furthermore, A^A. = 0 for i ^ j 

j 


k N 9 k 9 

X'X = Z Z 1 ( X. ,-X, )* + Z N. (X. -X ) 2 + N X' 

i=l j = l lm i=l 1 1 * . '* 


= X'A X + X'A X + X'A^X. 


Noting that 


EtX'AjX) =0, E(X'AX) = Z N (y-y) 2 , 
, i=l 1 1 ’ 

E(X A. X) = Ny, 2 , 


where y. - Z N.y. / N,we conclude that X'A.X and X'A_X are 
i_l i i i z 

independently distributed as x^_ k and X k-1 ( Z N ± (y.-y ) 2 ) , 

recnpr't i vp! v 1=1 


respectively. 


2.3.2. W.ishart Distribution and G eneralized Var-ianne ; Let X Q 
i = (X a ^ # • • • • » a=: 1, ..., N, be a random sample of size N 

from a p—variate normal distribution with mean and positive 
definite covariance matrix Z and let 


N . N 

s = z (£ - x) (x a - X) ' , X = ^ Z x a . 

a=l a -1 

For given sample values x a ,a= 1,..., N, 


(2.24) 


s 1 _ a , a 1 a 

N = \ 1 (* - 25) ( x - x ) ', x = - 1 x 

1 a=l a= 1 ” 

are the maximum likelihood estimates of Z , ^ respectively. 

Theorem 7,1? (i) v^N X is distributed independently of S as 

P-variate normal with mean v^N ^ and positive definite covariance 

^trix Z and S is distributed as 
N-l 
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where Z ,ql - are independently distributed p-variate 

nornial vectors with the same mean 0^ and the same covariance 
matrix E ; 

(ii) S is positive definite with probability 1 if and only if 
N > p . 

E TP . o f » (i) Let 6 be an N x N orthogonal matrix of the 

following form 



and let 


- e =i e “^ 6 ' a= 1 . N ' 

cx 

with 6 = 1/vft for all 6 . The set of random vectors Z has 

a joint normal distribution because the entire set. of components 
is a set of linear combinations of the set of the normal random 
vectors X a , a= 1,...,N. Now 


E(Z a ) = 


ft jj, if a = 1 

N N 

E 6 0 m = »fty I 6 Q —~ 
5=1 6=1 


= 0, if a > 1, 


cov(z a ,z Y ) = e e e e(x b - jj) (x Y - jj) 1 

U__^ otp Yp 
'0 , if a / y 

= < 

^E , if a = y . 

Thus Z 01 , a = 1, ...,N are independently distributed normal 
p-vectors with means and covariance matrices as given above 
Since 0 is orthogonal 









BASIC STATISTICAL CONCEPTS 47 


E Z a (Z a ) 


i x a (xV 


Thus 


s = i z a (z a ) ' , v/N X = Z-i 
a=2 

are independently distributed. 


(ii) Let 

B = ( Z 2 , ..., Z N ) . 

Then S = BB' . Since B is a random matrix of dimension 
p x (N-l) , this part will be proved if we can show that B has 
rank p if and only if N > p. Obviously by adding more columns 
to B we cannot diminish its rank and if N < p, then the rank 
of B is less than p. 

Thus it will be sufficient to show that B has rank p with 
probability one when N-l = p. For any set of (p-1) p-vectors 
oi*,..., a p “* in E p let S (ot*, .. ., a p_ *) denote the subspace 


1 1 

spanned by ot , ...., a p_ . Since the covaria 

^ pl 

is nonsingular, for any op in E p . 


Since the covariance matrix of each Z 


, Prob { Z a e S (a a p *) } = 0 for a= 1 .p. Sii 

Z ,...,25 p are independent and identically distributed 

Prob {£*,...,JZ P are linearly dependent) 


< E {Prob Z 1 e S (Z 1 , ...,Z 1 "*, Z i+1 .Z P )} 

i=l 

= p Prob (z 1 e S(Z 2 ,...,Z P ) } 

= p E [Prob {Z 1 e S(Z 2 .Z P ) | Z 2 = Z 2 .Z P = Z p }] 


= p E (0) = 0. 


Q .E .D. 


Thq distribution of S is known as the Wishart distribution 
with N-l degrees of freedom and with pararreter E . Its 
probability density function W (N-1 # E) (see Gir'i (1976), for 
a proof) is 
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r 


w p (N-i,z) = - 


K 


|s^< N -P- 2 > exp^trz^s} 
S is positive definite, 



where 


^0 , 


if 

otherwise. 


(2.25) 


K_1 = n T ( ^(N-l-.i) ) 2 il(N ~ 1) P ^(p-D/4 


P -1 


1=0 


(2.26) 


vLlJ U E 13 3 SCalar qUantit y and s is a scalar random ' 
variable, and W p iH-i.S, reduces to the probability density 
unotion of j times a central chiSquare random variable with 

mult- e9rees ° f freedom - The Wishart distribution is thus the 
multivariate generalisation of a central chisquare distribution 

varA P aYS the Same central role as that of chisquare in multi- 
late analysis of variance and covariance. 


The characteristic function of s is given by 

-h (N-1) 


(2.27) 


relate! ° f ** F " StatiStic which is 

related to the ratio of two independent chisquare random 

variables plays a.dominant role for testing various hypotheses in 

inear models. In the multivariate case, the ratio of the 

determinants of two independent Wishart matrices plays- a similar 
dominant role. 


E(exp {i tr 0S }) = |l-2i £0 | 

where 6 is a real symmetric matrix of dimension p x p . From 
(2.27) it follows that E(S) = (N-l)E . Thus s/N is not an 
unbiased estimate of I and the unbiased estimate of s is 

s/(N-l) . Furthermore, if s x .S k are independently distributed 

Wishart (random) matrices with N i - ‘ 

i' 

freedom and the same parameter Z , then 

k i=l 

Wishart with E N, degrees of freedom and the parameter i 


degrees of 

E S. is distributed as 


i=l 


For a p-variate normal population with covariance matrix E , 
Wilks (1932) called | E | the generalized variance of the popula¬ 
tion. For p = 1 it reduces to the ordinary variance and for 
P - 1 i1: isa measure of the internal scatter in the population, 

pplications it is often used to rank several distinct 
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mu ^ t ^ var ^ a ^ :e populations in order of their internal scatter, 


.a 


Similarly the generalized variance of a sample . x“,a = 1, 
from the population is given by 


* 


N 




I— 

'N-l 


E (x -x) (x“ - x) 
<x=l 


N-l 


N 


where x = ^ 2 x a . 

a=l 


Theorem 2,13 . Let S be distributed as W (N-l.Z). Then |S 

.... o 'i n 


is distributed as | E | ^ X^_ ^ • where 


i=l 


X 


■N-i 


P 

, 1 = 1 , 


• Vi 


are 


independent central chisquare random variables. 


Proof Since 2 is positive definite there exists a 
nonsingular matrix c such that C EC* = I. write S* = CSC'. 


Since S is distributed as W p (N-l,Z) we can write S as 


N-l 

S = 2 Z a (Z a ) • , 

a=l 


where Z a , a- 1.N-i are independent normal p-vectors with 

mean £ and covariance matrix 1 . Now 


; 


s* = 


N— 1 n~ 

z (CZ a ) (CZ a ) ' = I Y a (Y a ) , 


a -l a=l 

I where Y - CZ , a= 1,...^-!, are independent normal p-vectors 
I with .nean 0 and covariance matrix I. Thus s* is distributed 
as W p (N-l,I) and its probability density function (from 2.25) i 


Kls*!* 4 <N-2 -P> 


exp {-^tr s * }. 


Partition S* = (s*. .) as 


(2.28) 


u 


1J 


s* = 


(ID 


( 12 ) 


c * 

\ S (21) 


PP 


Since s is symmetric and Is 

i V ' 


A . PP' (11) (12) pp ®(2 

i e -’ oint Probability density function of S* w s* w s* is 

4 l 11) ( 12) pp 


- S* |c* * *~1 * 

S PP |S (11) - S ( 12 ) S PP S 

★ 


( 21 ) 





jj(N-p-2) e xp { -*^ s pp 


«l*ui) ' S(12) pp 


-X * 
* ' s ,, 


H(N-P 


_n- 2) 


S_ s ( 


exp 


{.!5tr(&*ii)- s (12) PP (21> 


jxpt-Hls^fpp 1 S A21> ) > * 


s* , s*' 1 s?,,\ • 

" b ( 12) PD ( 21 ) 


W * S 01) " (12) PP 


u = S 


(12) PP 


(S*~ H being the positive square root of S pp ) 


v * s . 

pp . . 

,c* A* S* ) + (W,U,V) IS 

The Jacobian of the transformation (S ( n) (12) ' pp 

given by the absolute value of the determinant of the following 
matrix of partials 
* / w u v \ 

■P 1 ’ / 1 - M 


where indicates some matrix which need not be evaluated 

specifically and A is the matrix of partial derivatives of the 
transformation 


( 12 ) 


-*■; u v* (v fixed) . 


( 2 . 2 ! 


srnce the Ja cob ian °f the trana^rmation (2.29) is | v| 4(p-l) 

the joint probability density f unction-of w u V ■ 

K ,»,>i(N-2> V. , ' ' U ' V 13 ^ven by. 

* (V) exp {-Jjv} 

x| w | J l(N- 2 -(p_ 1 )_i) 

* e *P -^tr'w) 

x e *P' {-^tr uu 1 }. 

* 0 * this we conclude that ( 2 . 

ndependently 0 f w = „ PP* ls distributed as w (n 11 

as w o * 8 Mu - S e *-l * W (n_ i) 

P -1 “ #I p-i^ where i PP S (-21) axid w is distrib 

is the ... 


P-1 the identity matrix of 










fSiffiasP 1 ■' J 

’ -• -v>. 'v 
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dimension (p-1) * (p-1) . Noting that W^N-1,1) is a central 
chisquare with N-l degrees of freedom we conclude that 
JWpfN-l^p) I * s distributed as the product of ± and 

Si |Wp_ 1 (N - 2 ' I p-l ) ^ where x N _ 1# w p _i(N-2,I are independent 

Rep6 at -‘- n ^ above argument p-l times we conclude that jS* | 


is distributed 

independent. 


2 

X N-i ' 


where 


%-i 


i i 1 1 «t•/p t are 


Q.E.D. 


2 

Hotelling's T -statistic, a multivariate extension of the 
square of the Student-t statistic, is fundamental in testing the 
hypothesis of equality of the mean vectors of multivariate normal 
populations with unknown but equal covariance matrix and plays the 
sane role in the multivariate case as the Student-t in the 
univariate case. We shall now discuss the distribution of 

•y 

T -statistic in the general setup. Let X a = (X X ) ' 

“ al'***' ap 

a = 1/ .../N be a random sample of size N from a normal 
population with mean jj and positive definite covariance matrix 
K I. Let 

| T 2 = N(N-l) X'S _1 X , 

f where X,S are as defined earlier. 

Theorem 2.14 . t 2 /(N-1) is distributed as y 2 (N u' I -1 u)/v 2 

2 2 P -= A N-p 

where x p (*) and X N p are independent. 

Proof . Let C be a nonsingular matrix such that CZC'=I 

and let Y =/N C X , A = CSC' and v =^Tcjj. Then Y has a 

p-variate normal distribution with mean v and covariance matrix 

N— 1 — 

! I and A is distributed as Z^ Y a (Y a ) ', where Y a , a= 1,...,N-1 
ate independent normal p-vectors with mean 0 and covariance 

matrix I. 

Consider a random orthogonal matrix Q. of dimension p x p 
whose first row is Y'(Y'Y) - ^ and the other p-1 rows are defined 
arbitrarily. Let 


.d U * (u r--^ U )'• 
A P 
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B = (B ±j ) = QAQ 1 


It is easy to check that 


U 1 = / 


“ 0 , i — 2 ,..., p, 


and 






Hi s" 1 X = Y' A" 1 Y 


= u 1 a -1 u 

= U l /<6 11- B {12) B (22) B (21) } 
wnere b is partitioned as 


B = 


B11# B (12) 


V B (2D ,B (22) 


r:- 8 —.« 

computation aimii ar to (p to, „ d Ehe Potacter I, by 

“< “ “ °°” 1 “ d * «. ionaitioLl 

; l6qUare "“h N-P ^greei ‘ife 0iWn Q is the central 
this conditional distt-K eedora (Wishart w (m 

*;p d “o Z 

Mr 9 " 663 ° f ' By^' 21 ’ * ls ° a <=ent 

and noncentraHty entral ChiS<I ' Uara with p ^ 2 ‘ 7 ' 2’i is 
; h nde — of y iy H7"l er NP ■ rl / reeS ° f tree don 

theorem. - - and B B -1 B The ' 

(12) '22) B (21) 1- el ear 

2 ‘ 3 - 3 - Cow., .. • Hence t 


2.3.3. Com 1 - Hen ' 

rand ° m . q 

«^zni 7777 ^^ a w- 

2 = (2,.! atribUtton - A p-vari ma91nary ' Parts nor »al 

^stributi'* 2p) ' with Z - v at ® Com Plex rand Ve a J °int bit 
*-va r - U the -al J 2 ; J + »j E j . * * '"*<*• 


^triC-'V’ with , = 7 ate t°mp lex 7 ve ^ joi nt 

0 ion if j_v J + jv n ^om ver>+- 

^tariate „ the rea l 2n „ J j' J = l fc or- 

E( 2 ) _ n °tmal distriK P V Ct ° c ( x '’••■P, has 

V '5 ana He* . ribu tio n . T ... y as a n, 

~ £ ' “here (2 " Utian C0I »Ple x ~ be a P c V "" Y r?' 1 

* - a)* , ^ ex c ovari=, * Coitl Plev p 

■ ih ' * d Joi,t o ;r *“«» *< 8 ? 

f ( 2 - a) \ - - 2.) C 

s probab; 
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density function is given by 

f (z) = (27r)" p |z| _1 exp {- (z-a) * E _ 1 ( < z_a) } . (2.31) 

The special case with the added restriction that 
' E(Z - a) (Z - a) ' = 0 

is of considerable interest in practical applications- This 
condition implies that the real and imaginary parts of different 
components are pairwise independent and the real and the imaginary 
parts of the same components are independent with the same 
variance. With the above'probability density function one can 
establish in a straightforward way that, as in the-real case, if - 
Z has a complex p-variate normal distribution with mean a and-- 
Hermitian positive definite covariance matrix I, then CZ^, where 
C is a complex nonsingular matrix of dimension p x p, has a 
complex p-variate normal distribution with mean Ca and Hermitian 
positive definite covariance matrix C L C*, and any subset of the 
components of Z has a complex multivariate normal distribution. 
Furthermore if the covariance between any two subsets of the ~ 
components of Z is 0 then these subsets are stochastically 
independent. Other results related to analysis of variance 
(complex case) will be included in the exercise as problems. 

2.3.4. Multinomial Distribution : For the analysis of variance 
of discrete data in the form of observed frequencies in discrete 
categories we need to outline the multinomial distribution. Let 
X = (X^,..., Xp) 1 be a-discrete p-dimensional random vector with 
probability mass function 




~ 11 p i ' if 0 * x. 

p* i = 1 i 

for all 


(JD , otherwise, 

p 

where p i £ 0, subject to the restriction that l p 

P 1 = 1 

and l x. = n. It can be verified that 
i = l 1 


z n 
i, 

(2.32) 


1 . 


E(x i ) = n Pi , Var(x i )=np i (l-p i ) , 
Cov(X i ,X J .) = -np i Pj (i j* j) . 
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The marginal distribution of (X^ f ...#X ) '/ q * p is given 


q 


y t • • • § ^ ) = —r Y | /_ n ~v. n p. (1“P-i“ • 

> i / •« • / a l q X 1 . * •, x m \ n— n ) \ - i 1 

J- q , 1 q oi=l 


n-n 

-Pq> < ° 

(2.33) 


where Z x. = n . 


1 o 


„ _ x i - ^ 

^i n • 1/ ... /P- 

✓n 

Then# as n ■+• ® , ( Y , ..., Y ) 1 tends to a multivariate normal 

i p-1 

distribution. 


2.4. INVARIANCE IN STATISTICAL TESTING 

Invariance is a mathematical term for symmetry and in practice 
many statistical testing problems exhibit symmetries with respect 
to certain transformations in the sample space. It is a generally 
accepted principle that if a statistical problem with a unique 
solution is invariant under certain transformations then the 
solution should be invariant under those transformations. 

Example 2.3. Consider the problem of testing the hypothesis 

that the mean of a univariate normal population with unknown 
2 

variance a is equal to a specified constant y . Since y = y 

O ' Q 

implies that for scalar constants 'a(/ 0), b, ay + b = a y + b 

o ' 

we should expect that our test procedure should depend neither 
on the scale of measurements nor on the origin of measurements. 
For example if the measurements are lengths, our test procedures 
should not differ whether the measurements are in inches or 
centimeters and whatever may be the origin of measurements. 

Let x be the sanple space and A be the o-algebra of subsets 
of x which contains the null set, the whole space and is closed 
under complementation and countable union. Let Q the parametric 
space and let p denote the family of probability measures 
p 6' ® e a on A . we shall be concerned here with the problem 
of testing the null hypothesis H Q :0 £ ^ against the 

o 
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alternatives e 

of 8* 

'* The principle of in 


where ^ and % 

A J- 


disjoint 


subsets 


for testing problems inv 
The principle °f invariance for 9 

^ansforsmtions -inly on the sample spa e * »* ^ ^ 

^parametric space Q . Among these the mo 
. rpug transformation on ft i s 

transformation g on * . ™ A11 transformations 

transformation g on ft , indue Y assumed t 

considered in the context of invariance will be assumed 

the following properties: 

(i) one-to-one from x onto X/ that is, for every x i e 


involves 


to have 


there 


exists x 2 e x 


Ji 

such that x 2 = gfXj) and gtx^ = g<x 2 > implies 


that 


X 1 * X 2* 


(ii) bimeasurable: this is merely to ensure that whenever X with 
values in x is a random variable, g(X) , usually written as gX is 
also a random variable with values in x and for any set A e A 

gA = {gx:x e a} and g~*A both belong to A . 

The induced transformation g on ft corresponding to the 
transformation g on x is defined as follows: 

jfl? if the random variable X with values in the sample space x 

has distribution Pg, 0 e , then gX with values in x is a random 
variable with distribution P 0 ,, where 0' =,g 0 e ft This is 
usually known as the condition of invariance of the family of 
distributions {P , 0 e ft } . 

An equivalent way of stating this is, for every A e A 


P 0 (X c B' X A} = P e , {X e A} 
which can also be written as 


p 9 . {g'Aj = p , {A} 


thl” k 9 * 15 thS iI>VerSe t^sformation corresponding to g and 

«> e ; is rr. e in the ieft raernber ° f <2 - 34) and (2 - 35 > 

'a 1 " the right T rand ° m Variable X and the ^script 0.=g 0 

V the ° f -aicates the distribution 


(2.34) 


(2.35) 


the 


random variable gX. It is now evident that ( 2 . 


35) can also 
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be written as 


P 0 (a} = p 0 , {gA} . (2.36) 

In terms of mathematical expectation (2.34) or its equiva¬ 
lent can be stated as follows. For any integrable real valued 
function 


-1 


E ft (<|>(g X)) = E- Q ((j> (X) ) , 


(2.37) 


where E Q denotes the expectation of <j> when the random variable 

X has distribution P 0 and E- n refers to the distribution P- 
of x. 


ge 


go 


If, in addition, all P Q ,0 e £2 are distinct, that is, if 


0 i * e 2 # e x e 2 


e n , implies that P ^ P then g determines 

«1 0 2 


9 uniquely. In other words for any 6 , ge is unique. To see thi< 
let for A t A , let 


V A) = p 6 .(sa> = P 0 „(gA) 


for 0 . 8 ', 6 " belonging to SI 
e'\= e". 


Since all P„, 6 e n are distinct. 


0 


On the other hand if (2.3i> holds then the unicity of 0 . 
es that all P 0 e Si are distinct. 


implies that all t 

0 


Invariance of the paramet-^ - r — a _ he 
remains invariant under a one-to-on 7 Transf Parametric s P ac 
operating from the left, on x onto x if th , l0n ' 9 
9 on Q satisfies (i) g 0 e ^ f „ e ln uced transformati 
«>ere exists a 0 e 0 such ^ ! -! -y 0 . e fl 

stating (i) and (ii) is g • An equivalent way of 


gQ = n. 


Given a set s of ' tranef 

following theorem Will assert^hT' leavin 9 « inv. 

ended to a group G whose members 1 lT* ^ S Can be 

transformations of G wiu also leave a invar 

^measurable if the member of s r6f ° re ' one-to-one a 
considerable ^ s are So 

theoretic ^ ance to us as w e ca * * result will 

«oreti c results for- 4.x. can use wen v 

the <3evelopment of th ° Wn 9r ° U: 

the invariance pri 




3 


Theorem 2.15. Let g.,g. 
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i 

be two transformations operating from 




' ij 


17) 

6 


' l '*2 

the left on x onto x, leave £7 invariant. Then the transformations 

g l g 2' g l _1 defined b Y 

g 1 g 2 (x) =g 1 (g 2 (x)), g^g^x) = x 

for all x e x also leave ft invariant and satisfy 

-1 
1 

Proof . If the random variable X with values in x -has 
distribution Pq, 0 e ft , then the transformation g^X has - 


g l g 2 


= g x g 2 


(g x 1 ) = g- 


(2.39) 


distribution P- Q , g.0 e ft 

g x « l 

ft 


Since g^0 e ft and g 2 


leaves 


invariant, the distribution of g 2 g 1 (X) = g 2 (g^(X)) is 


Us, 


'g 2 5i 6 Pg 2 g l 0 


where g 2 g^0 e ^ for 0 e ft . Thus g 2 g^ leaves ft invariant and 
Similarly (ii) can be proved. 

Q.E.D. 


g 2 g l g 2 g i 


=t, 


ce 

ion 

ft 


Let G = (g:g £ G>, where G is the group of transforma¬ 
tions operating from the left onto itself. Since for g^»g 2 in 
G (2.39) holds, we conclude that G is closed under composition 
and inverse. Furthermore if 0 is the identity element of G, 0 
is the identity element of G. Thus G is a group of 
transformations on ft corresponding to G on x and (2.39) 
states that the correspondence between G and G is a 
homomorphism. As stated earlier if all P Q , 0 e n are distinct, 
g e G is also one-to-one onto itself. 


Let us now consider the problem of testing H Q :0 e 


38). 

a rtf' 

it* 


against the alternatives 


: 0 


e fin where , Qj, are 
1 o 1 

disjoint subsets of ft. Let G be the group of transformations 

which operates from the left on x and satisfies (2.34) and 

(2.38). The problem of testing H Q against remains invariant 

under Gif for g e G, A e A 


p e ( 9~ 1 A> = V A) 

and 


(2.40) 
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\ = \ = °\ 


(2.4] 


^pple 2.4. Let X^,...,X n be a random sanple of size Ji 

from a normal population with mean y and variance o 2 with 

corresponding sample values x v my.,, i 

f uet, x^, ., . The sample space x 

° f X 1. X n is the space of all sample values (x,,...,x ) 

Let 1 n 


n 


x = , E x 
n . , i 
i=l x 


n 


2 1“ o 

S = n 2 <V 5) ' 

1 = 1 


n 


X = n 2 x - 

n i=l 1 


n 


S " = n ( V X) ^ 
i=l 


6 = (y, a 2 ) 


The parametric space is Q ={e = (u,o 2 ) , - < „ < .. 0 2 > 0} an 

let Qtt = ffft.rrA „ 2 ^ , , O o 


let ! 2 Ho = { to, a 2 ) , a 2 > 0}, j> = {(p,p 2 ): „ * o, s 2 > o}. The 

group g of transformations g ihich leaves the problem invariant 
is the multinlicat 1 Vm nrmm ^ -P „_ t _ _« 


= -—. vco uuc prociem 1 

is the multiplicative group of scale changes x ± - ax., 

1 = l,...,n, where a is any nonzero real. The induced’ group 5 

on 0 is the group of transformations g operating from the lei 

on 0 such that ge = (a u ,a 2 a 2 > . obviously gn-fl £or all 

g s G. AS for any A S A , g e G. X = U,...,* , e ,-1* impli6 

that gx e A, we have with y. = ax i - i 

J i j/ J- - l, ... , n, 

_- J 5n,„2,-4n " ' 2 

g‘A 


P {g\A} = / ( 2271-) (r?-\ ~% n r 1 n ^ x -i“V>) 

6 q A exp {-ij Z —i_-}dx ,dx 

y i=l -1 n 


r(')ir\~% n i 2 2\-^n f n (Yc-ay) 
- / ( 2it) (a o ) ^ exp {-^ I - 1 


i=l a 2 g 2 


■}^r, ... dy 
1 n 


P ge^ * 


Also 5 . 5 ^ - n Hi 

from ^ X l' * ’'' X m be a random sample of size 

let y population with mean ^ and variance o 2 and 

a . ra ? d ° m Sample ° f » *P. another 


. . * --* n rrom another 

independent normal population with mean u an , „ . 2 

liie sample space of (X,,...,x Y Y . 2 . arian ce a . 

, 1 m' Y 1 ' • vis the space of 

sample values (x.x . v , n . Pa e ° £ 







tSb&PP 

-if 


X = ^ * X. , y _ 1 n "' ' a i | CAl C( 

m i=l 1 Y ~ n .f y., s 2 = i m 

. in m i=l (x 1 -x) 2 , s 2 = 1 " . . 2 

x = i z x , 5 = I « • . 2 " i . 
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m . - 

i = l 


i' = n Y s ; s 2 Ji 

1=1 1 1 ~ ^ 


e = ( y 1# \ 1 2 , a 2 ) . 


. Z tX.-x) 2 s 2 - 1 ? ' ? 

i=l 1 * 2~ n z CY. _?) 2 

i=l 1 


The parametric space a is 


1 AS given by 
fi = {0 • ~ 00 < y. < oo i 

1 * 1 = 1. 2, o 2 > o j 


For testing the hypothesis H , „ 

alternatives H • u d , O’ 1 v 2 ~ u tsay) against the 

1 1 ^ y 2' we get. 


{(Ms cr), ■“ < n < ” > 0 }, 


{ (y 1 »y 2 ,o ), y x ? u 2 , a 2 >o) . 


The group G of transformations which leaves the problem invariant 
is the group of affine transformations (a,b), with a t 0, which 
transforms each x i -^ax. + b and each y i -*• ay i + b. The induced 


group G is the group of transformations g such that q0= lap 

2 2 1 
+ b, a y + b, a a). As for any A e A and g eg, (x,, ..., x , 
z _ 1 1 m 

y^, . .., Y r ) £9 A implies that 


(ax 1 + b, .. .,ax m + b, ay x + b, .. .,ay n + b) e A 
*e have with = ax. + b, v i = ay^^ + b, 

m 

>( g 1 *) = ' (2*)- h(m+n) (o 2 )- h(m * n> exp{- J-( I ( x ) 

0 gA 20 1 


P( g 


+ I (y j-y 2 ) 2 ) ) 
i=l 


m « 

n dx. n dy. 
i=l 1 i=l 


m | 

(m+n) (o 2 )' Mm+n) ex p ( - iv (l iii (vaUl ' b 


= / ( 2tt ) 
A 


+ l (v -ay 2 -b) 2 )> 

i=l 


m n 

n du. II dv. 
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= V A) - 

Furthermore gOj = ^ g^ = ^ . 

Example 2.6 . Let X a = U al , ... ,X ) ■, a=l, 

(N > p) be a random sample of size N from a p-variate normal 
distribution with mean jj = (y^, ..., y ) 1 and positive definite 
covariance matrix E. Let x a =(x ..,...,x )', a = 1,...,N, 

0i 1 OiP 

denote the corresponding sample values. Write 


N 

x“. 

N 




E 

a = l 

s = E 
a =1 

(x a - X) 

(X a - 

x) • , 

N 

x“. 

N 




E 

S = E 

(x a - X) 

(x a - 

X) • 

a=l 


a=l 





The sample space x. is the space of all sample values and the 

parametric space fi is the space of all values of 9 = (y,£) . For 

testing H Q : _y = 0^ against the alternatives H : jj ^ we get 

Jl, = {(0,E), I is positive definite}.. 

0 

= {(y»E), y^O , E is positive definite } . 

1 

The group of transformations g which leaves the problem 
invariant is the full linear group G^p) of nonsingular matrices 

g which transforms, each X a to gX a . The induced group G is 

the group of transformations g satisfying g6= (gjj, g £ g') . 

As for any A e A , (x 1 ,. ..,x N ) e g _1 A implies that (gx 1 ,...,gx N ) 

e A, we have with v = gx , a= 1,...,N , 

P n (g _1 A) 


= tj. <2 "> !{NP 1 1 1 _!SN expf-ijtr I -1 £ (x a - jj) (x“ - u) ■} f dx “ 

5 A a=l 0=1 “ 

= / ( 2 tt) | g2g. 1 expt-iitrfgZg ')' 1 Z (i“-gja) (^“-gja) •} n d£° 


a = l 


= Pge (A) • 

The fact that = ^ 


is evident. 
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We are now concerned with the nr„M „ v 

e P rot) lem of characterizing the 

invariant teats. To do that we need the following definition. 

ltea MUnjJBtta Uwrl.nt functions! , 

* function T (x) defined on x is invariant under the group of 

transformations g operating from the left on V onto itself if 
T(x) = T(gx) for all x e x and for all g e G . T(x) ls called a 
maximal invariant under G if it i s invariant under G, and if 

T(x) = T(y) for x,y belonging to x then there exists a g e G suet 
that y = gx. 


To investigate the behaviour of the maximal invariant with 
respect to the group g in the sample space, let for x e x 

0 (x) = {gx: g e G>. 

■' 

. 

0 (x) is called th e orbit of x with respect to G. A function 
T(x) is a maximal invariant with respect to the group G if and 
only if it is constant on each orbit and it takes different 
values on different orbits. All maximal invariants are equivalent 
in the sense that their sets of constancy coincide. 

Let <Kx), x e x be a statistical test of H Q against H , that 
Ik is, <Mx) is the probability of rejecting H Q when x is' 

N Jobserved. For a non-randomized test <J,(x) takes values 0 or 1. 
Suppose <t>(x) is invariant under a group G of transformations g 
operating from the left on x. The following theorem gives a 
useful characterization of c}>(x) in terms of the maximal 
invariant T (x) . 


1 — eorem _ 2 1 16 . A statistical test <j)(x) is invariant under G if 
and only if there exists a function h such that <j>(x) = h(T(xl) . 

Prgof. Obviously, if (x) = h(T(x)) then for x £ x ,g e G. 

^(gx) = h(T(gx)) = h (T (x) ) = <j)(x). 

Conversely if c|)(x) is invariant under G and T(x) = T (y) for x 

* 

y e x, there exists a g e G such that y = gx and therefore 
| <f>(x) = (J)(y) . 

Q.E.D. 



In general h may not be a Borel measurable function. 
However if the range of T is Euclidean and T is Borel measur- 
able then h is a Borel measureabie function (Blackwell, 1956) . 
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The induced group G operates from the left on & onto itself ( 
We can thus, similarly, define a maximal invariant v(0) on ft with 
respect to the induced group G. The following theorem asserts 
that the principle of invariance identifies all parameter points 
that are equivalent with respect to G. 


Theorem 2 t 17 . The distribution of the maximal invariant T(X) 

with vs lues in the space y where X is a random variable with values * 
in x depends on ft only through v(0) . 


in ft. Since 


EC 22 £. Suppose that v^) = v(e ) for 6 , e i. 

v<0 ) is a maximal invariant on SI under G, there^xists a g £ q 

B £ R h 96l ‘ * 8 ^ the °- al 9 ebr£i in Y. Then for 

B G B , by (2.31) , 


P 9l {T(X) e B} = P Q (T (gX) e B} 


= P ge e B) 


= P 0 {T(X) e B} 


Example 2^7. In Example 2 4 the Q ' E ' D ' 

sample space is |t| where ' ^^Wal invariant in the 


t = ^x/y-i_ E(v5) 2 


snd the corresponding maximal invaria . . 

IM where X= p /„ m , lant th e paramstr- 

, ' * T has the ^ . ®^^^-ric snace 

a u d d US Pr ° babllit v e denl it d y St f r u ibUti0n with - 
V^.9), Under h r\ ™ function f /4- | \\ 

■-*■»« w r„rr » »* 

f lTl't|X) = £. (th , . ... en b V' for t > o 


|T| 1 I 1 ) f T (t I X) + f T t_ t | nj 
Thus 


f]*|fcl*> 2 > 4 

ex P {-4X 2 } j u )iJr ( 1 


__J + l)) 

^ ^ (n+i) j 





, j 

... J w - •*'-a in+i) -z _ t J 

ich is an increasing functi (n+t 2 )^j 

fUndaraenta l the uni! ° r t >V0. 

un iforn>iy 


(2.4 


most nr, 031 " 9 Neynan 
Werf ul invari 
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test of H q against the alternatives H^:y ^ ^ rejects 
whenever |t| ^.C, where the constant C depends on the size a 
of the test. 


Note . The uniformly most powerful invariant test of size a 
is the uniformly most powerful test among all size a invariant 
tests. 


Example 2.8 . In Example 2.5 the maximal invariant in the 
sample space is |t| where 


2 2 

1 i _ _ ms + ns _ , 

t = + j? *(*-y) /(——~ g ) h 

m n m+n-2 


and the corresponding maximal invariant in the parametric space 
is j A | where 


X =( S + 7 a - 

T has the Student-t distribution with m + n-2 degrees of freedom. 
Under H Q , ,A= 0 and under H | A | > 0 . Thus from the above 
example we conclude that for testing H Q against H 1 the 
Student-t test which rejects H Q whenever ^ 1 1 1 > C, the 
constant c depending on the size a of the test, is uniformly 
most powerful invariant. 


Example 2.9 . Consider Exapple 2.6. Let 
T(x) = N(N-l) x> s" 1 x . 

Since for g e G^tp), the transformation x a -»- gx a , a = 1, .. ,u 
induces in the space of ( X,S) a transformation which transforms 

(X>S) -v (gx, gSg') , 


T(gx) N(N-l) (gx) ' (gsg') 1 gx 

= N(N-l) x'g'(g') -1 s"' 1 g _1 gx 

= N(N-l) x 1 s" 1 x 

/ ““ ^ 

= T(x) . 


Thus T(x) is invariant under 
in to itself. To see that it is 
for x, y e x 


G £ (p) acting from the left on x 
a maximal invariant let us suppose 
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T(x) = N(N-l) x' s ' 1 X, T(y) = N(N-l) s ” 1 y , 


T(x) = T(y) , 

where y = (y 1 .y N ), 


1 ? 

N £ i 

a=l 


a 


N 


O — 

1 " s 

a=l 


( Z a -£) ( x a -y) 1 


Since N > p, s and S.^ are both positive definite with 
probability one and hence there exist nonsingular matrices B and 
—i such that s = BB', S;L = B B Thus 


(B _ 1 X) • (|x) = (B" 1 ^) 1 ( B ~ 1 £) . 


This implies that there exists an orthogonal matrix 0 such that 


B 2S - 9. Bj 1 X. or equivalently y - 0 _ B 1 x 

Writing g = B 6B we get £ = g x which implies that 
_ .a , ... 


a _ a 

Z - gx , a - 1, ..., N. Obviously g e G (p) . Thus T(x) is a 


maximal invariant under G (p) . 


g £ (p) is 


Similarly a maximal invariant on a under the induced group 


v(e) - 1 %' e 1 _y_ = < 5 ^ (say) . 


T(x) is distributed as Hotelling's T 2 and its probability density 


function is given in Theorem 2.14. Under H Q , 6 2 = 0 and under 
H 1 # 6 > 0. The ratio of the probability density function of T 2 

when 5 2 = 6 2 (fixed) to that when 6 2 = 0 is given by 


L&P-> / i f 2y “ (6 2 /2) j r(H N+.j) , t 2 x l 

r(W exp { * 6 i } jir(isp+j) N+i+t 2 ^ 


where t is a value -of T . For fixed N and p and given 6 : 
this is a strictly increasing function of t 2 /(N-l+t 2 ) and hence 
of t . By using Neyman and Pearson's lemma we conclude that the 
test which rejects H q whenever t 2 > c, the constant C dependin* 



on the size of the test, is uniformly most powerful invariant for 
testing H q against H^. 

In the search for the optimum invariant test it is then 
necessary to find a maximal invariant statistic and its distri¬ 
bution. Sometimes it is convenient to obtain a maximal invariant 
statistic through a number of steps each corresponding to a 
subgroup of the original group G. The question is under what 
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ditions (if any) the maximal invariant under G and the 
ximal invariant obtained in steps corresponding to different 
subgroups of e the same. ...For simplicity let us assume that 

jiG is ^nerated by two subgroups H and K (it is the smallest 
r group containing H and K) . i t ls left to the reaaer to ver if y 

that G consists of the totality of products n 

. ■ __i h . k . , m = 

.. where h i e H. k e K, i = i. m . 1 1 


ftie orem 2,18 . Let y f (x) , x e x be a maximal invariant on x 
with respect to the subgroup H operating on x from the left 
sU ch that for any k e K, f (x^ = f(x 2 ) with x^ e x 
implies that f (kx^) — f (kx^) . If Z = T (y) is a maximal 
invariant in the space of y with respect to the group ' K* of 
transformation k* defined by 

k*f (x) = f (kx) , 

then T(f(x)) is a maximal invariant with respect to G. 

Proof . Let 

« = k m h m ••• k l h l 


here hu e H, k^ e K, i = i. 


kjl^x)) 


/m. Then for x e x 


T (f (gx)) = T (f (k m h m 


= T(k f(h k h ... k h x) ) 
m m m-1 m-1 11 


T(f(k m-l h m-l k i h i x) 


= T(f (x)) . 

Thus T(f(x)) is invariant under 


Let x a/ x 2 e x and let T(f(x 1 )) = T(f(x 2 )). Writing y 
= f ( x j_) and y 2 = f(^ 2 ) we,get T(y 1 ) = T(y 2 ). Since T (y) is a 
maximal invariant with respect to K*. There exists k* e K* 
such that y 2 =k*y 1 . Thus f (x 2 ) = k*f (x^ = f (kx^ . Since f 
is a maximal invariant with respect to H, (2.4 3) implies that 
exists an h e H stich that x 2 = hkx^. Obviously hk is 
element of G. Hence the theorem. 


Q.E.D. 
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2.4.1. Almost In-variance and Invariance : To study the relative 
importance of different test criteria we need to compare their 
power functions. Thus it is of our interest to study the 
implication of invariance of power functions of tests rather than 
of the tests themselves. Since the power functions of invariant 
tests depend only on the maximal invariant in the parametric 
space, invariant tests have invariant power functions. The 
converse of this fact, namely, if the power function of a test <f> 
is invariant under the induced group G, that is, 

E e <t>(g _ 1 x) = E ge <HX), 

then the test <(> is invariant under G, does not always hold 
good. To examine it further, let us first define the concepts of 
equivalence to an invariant test and the almost invariant test. 

Definition 2.3 . A test 9 (x ) 1 is equivalent to an invariant test 
cp (x) , x e. x with respect to the group G of transformations if 
<j>(x) = 'J'(x) for all x E x-N where P 0 {N> = 0 for a11 0 e fi . 

Definition 2.4 . A test <Hx) is said to be almost invariant with 
resoect to the group G of transformations if for g e G, <J>(x) 

= 4>(gx) for all x e x-N 'where P^{N g } = 0 for all 0 e fi- 

Thporem 2.19 . If a test 9 is equivalent to an invariant test 9 
then it is almost invariant. Conversely, if G is countable 
then any almost invariant test 9 is equivalent to an invariant 

test. 

Proof . Take 
N = N U (g - 1 N) . 

g 

Now x s x-N implies that x e x-N and gx e x-N. So for 
X e x-N , >(x) = (t>(x) = <Mgx) = t(gx) . Since P e {g _ 1 N} =P- 0 {N) 

= 0- VV = °’ 

Conversely let the group. G be countable. Write N = u 

gzG 

Since if* is almost invariant, for g e ^(x) 

* ty(gx) for all x e x-N with P fl {N } ■=* 0. Thus P Q {N} " 0 

g 0 g 0 

and ij; (x) = (J;(g x ) for x e Now define <t>(x) such that 





p 




fix) 



x e H 




if 

if x E >1-0. 
^viously* *(x) is an invari ant 


BAs 


C ST AT 



> .yagj 
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invariant test 
If the group g 


fun ctio n 


rj 


and i),( x) 


18 e q“ival ent to 


IS 


s 

op 


gener al. we shall now does « £ d in 

converse holds. “"dec which the 

L6t / ^ ! hS Sample 8 ^a and i et * „ 
iUbsets of x. Suppose that G - i s a * be the o-field of 

.erating from the left o n * onfn * 9r ° Up of transformations 

o-field B. Let for any a e A the th rrespondin, 3 

gx e A belongs to A x B . SupposI 8 "* ■ th * t 

measure V {that is for B^b., . in B with ub - < ’' £lnite 

£or all i) on G such that ii(b) = q imcl 1 ° and M(B i* < “ 

*° 0 lm P!ies that u(Bg) =o for all 

o e G. Then any almost invariant £u nt -n„„ 

* function on x with respect to G 

is equivalent to an invariant function with respect to Q. For a 

proof of this result the reader is referred to Lehmann (1959), p.225. 
requirement is satisfied in particular when 


The 


jjf^P(Bg) = M(B)» B e 8, g e G. (2.43) 

in other words y is a o-finite right invariant measure. Such 
right invariant measures exist for a large number of groups. 

N * 

Exajnple 2,10 . Let G = E (Euclidean N-space) with addition 
as the group operation. The Lebesgue measure u in the space of 
E N is a right invariant measure. Since G is abelian, the right 

also left invariant, that is, y(gB) = y(B). 

be the positive half of the real line 
operation. The right invariant 


invariant measure is 

Example 2.11 . Let G 
with multiplication as the group 
measure is given by 


y(B) = / 


d£ 


B 


the 


Example 2.12 . Let 


V 


multiplicative group of 


iand let dg = n dg^ 

i/j 



Since 


full linear group, that is, 

' 1 ij i 

of the transformation 


G be the - _ . % 

„ X N nonsingular natrices g - <«i, 

the Jacobian 
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g gh for g,h in 
on G is given by 


is 


m n . 


the right invariant measure 


V(B) = / 


_dcL 


B g 


N 


i i 

Example 2.13. Let G be the group of affine transformations of 
the real line R onto itself, that is, g e G has the form 
<a,b) ' a * 0 such tha * <a,b)x =ax + b. The group operation is 

y l g 2 “ ' a 2 ,b 2 in G ' b Y q \ q 7 = 

a l a 2' a l b 2 + b l^ * Tbe Jacobian of the transformation g g g 

a _ . Thf* T~ i nVlf 4 ..r _ ^ 12 


resp< 


U(B; = / 

B 


The right invariant measure 
da db 


P on G is 


"Lrairr^ TQeaSUre iS nCt equal t0 the right *"»*!«* 

measure and is given by 
da db 


V(B) = ; 

B 


■« ”:: z tv * m ~ "•“•”-“"”1 


The 


g l g 2 ( c 1 C 0# C,b 
where 


+ *i> 


12 ' 1-2 

given by 1 " <Pl ' &l) ' ® 2 = <C 2'-2* ‘ The ri 9 ht invariant measure j$ 



U(B)= /’ 
B 


dC db 

iN 


, h « tB . plobl „ >f 


against the alternatives 
disjoint subsets of 


H 


mg H • 0 e 


0£ 




\ ' Where fi H a "d n ar ° 
« o n_ 


Ml. 

\ 



-i n ""u cu 

sati sfying 'i, u o ° 1 

invariant under the group G of «. ° ^ 1 °* reroains 

from the left or. of transfor ™ation n 

eft on * and that (2 431 v , . 9 operating 

umformly mos t powerful , " ' h ds ‘ T1 >en if * ls 

then it i s al .„ . ^ mong the class of all < *° 

almost invar . ° rmlY "»st powerful amona riBnt tests 

iant tests for this —v, . 9 the class of all 




Problem. 


v'* ' ■' ■ • 

.. 

Ess?? * • 
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I£ * 15 al,K>St by (2.44), it is equivalent 

to an invariant test $ . The tests * and „ have the same p£)Mer 

function. Hence i s uniformly at least as powerful as 41 . 

m. Q.E.D. 

W It is now evident that any almost invariant test with 

respect to a group G of transformations has an invariant power 
function with respect to the induced group G on fl. The converse 
of this does not always hold. However, the converse holds in cases 
where before the application of invariance, the problem can be 
reduced to one based on sufficient statistics in the sample space 
whose distributions constitute a boundedly complete family. Thus 
in such cases with the assumption ( 2.4 3 ) invariant tests can be 
compared with their power functions. We shall use the term 
sufficient statistic to mean the minimal sufficient statistic. 

Let T be a sufficient statistic for {P Q , 0 e and let the 
family of distributions {P*,9 e ft} of T be boundedly complete, 
that is, for bounded function f(T) of T if 

E 0 f(T) \° (2.44) 

identically for 6 e then f(T) =0 almost everywhere with respect 
* to the probability measure P 0 . Let ii»(T) be any invariant test 
ppirith respect to the group G of transformations operating from 
the left on the space of the sufficient statistic T as 

gT(x) = T(gx) . 

Then for g e G 

E 0 <MT) = E 0 ip (gT) = E- 0 H»(T) 

so that <p (T) has* an invariant power function. Conversely if , 

V (t) =E ge^ (T) ' 

then for g e G 


V(T) = E 6 *(gT) . (2.45) 

Hence 

V'MgT) - ^ (T)) = 0 

identically for all 0 e ft. Since the distribution of T is 
v^boundedly complete, (2.46) implies that \p (T) = i|;(gT) almost 
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everywhere with respect to the probability measure PgT. 

For any test <J>(x) defined on the original sample space x 
Y(t) = E 0 (<J>(X) |T = t ) 

is also a test with the same power function as that of <j)(x) . Thus 
we conclude that if there exists a uniformly most powerful almost 
invariant test among all tests based on the sufficient statistic 
T then that test is uniformly most powerful among all tests based 
on the original observations whose power functions depend on the 
maximal invariant in,£he parametric space. 


Ex ample 2.1 5. Consider Example 2.4. Let us first show that 

..._ 9. o 


the sufficient statistic <x,S 2 ) for (y,a 2 ) is boundedly 


complete. _From Example 2 .1, x,S 2 are independently distributed 
an nS /a is distributed as the central chisquare with n -1 

nfirtroAp* .c___ 


degrees^ freedom. Thus the joint probability density function 
OI is given by 


f X,S 2 = k(o 2 )*^ n e xp {- ^(ns 2 + n(x + w) 2 )} (ns 2 )^( n -3> 

2c ' 


where 


h ~ Vn/2ir T lUlr\-“\W v 

For an Y bounded function f(x s 2 } 

P / £ /Tv n »v t _ O t • 


2 - -^ \ A ,0 ) 

(f(X,S )) =k / f(x,s 2 ) (a 2 )~ n/2 exp { 1 , 2 - 2i 9 

P U _2 (ns +n(x-y) 2 } d x ds 2 


Write 

72 = 1-26 and (1-26) ~ 1 t = y. 


2o‘ 


o 

The a 


E(f(X,S 2 )) = k / f(x,s 2 ) ( 1 - 28 )* 5 n (ns 2 ) (n- 3 ) 

.2 




If Ef(X ' s2 )=0 identically f or all 6 .. 

k ' f(5 ' ns2 + »5 2 - nx 2 ) (ns 2 ) ! 5(n-3) ’ ““ fr ° m t2>46) 

^ <-Mns 2 W, + 8 (ns 2 +nx 2 ) + n tx) * ,2 

identically ln 9 = „ 

8 end t. Treating this as sv, 

the Laplace tra 







' y? 


of 


... . 
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^ 2—2 2 
f (x,ns +nx -nx ) k (ns 2 ) ** ^ n ~ 3) 


exp {-i$ (ns 2 +nx 2 ) } 


with respect to variables nx 2-2 

W- 0 except for a set of v , ' ^ +n * ' , W ® oonclude that f(X,s 2 > 

values of i y cA . 

measure 9. So the aistnw ' , Wlth Probability 

distribution of (y <z 2 \ , , „ 

S ) is boundedly complete. 

Suppose that for testing « 

H-sU * 0, the power of the t ° 8 - ~2 against alternatives 
Then for g e G (note * U ' S > only on X=^ y/o. 




(X, S 2 ) ) = E 


2 2 ('MX / S 2 )) 
ay,a or 


So 


~ E a 2 s 2 )) 

V, o 


E 


2 C<MX,S 2 ) - i(i(ax..a 2 S' 2 ) ] = o 


y, a 


r, _ 2-2* 


identically in (y,o 2 ). This implies that 
<Mx,S 2 ) = ip (ax, a 2 S 2 ) 

j a ^" mOSt ever ^ rw l lere - Thus from Example 2.7 we conclude that of all 
jltests of y = 0 with powers depending on X , the Student t-test 
which reject H q whenever |t| >_ C is uniformly most powerful 
invariant. 


Example—2. 16 . Consider Example 2.6. Let us first establish 
that the statistic (X,S) is sufficient for (y,E). From Theorem 
2.12 the joint probability density function of x,S is given 
by 

C | E | | s | ^ (N-p- 2 > exp £ _^ tr j;" 1 ( s+N (x-y) (x-y) ') }, 

where 

C~ 1 = 2* 4Np x p(p+1)/4 l r(4(N-i) ' 

i = l 

For any real valued function f(X,S) of (X,S) 

E(f(X,S)) — C S f (x,s) [ | E | -!aN |s| J5(N_P " 2) ] 

* 

x exp {-!jtr(I -1 s + N(x-jj) 1 Z _1 (x-^))}dx ds 


(2.47) 
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P 

where dx = n dx., ds = n ds . . 

i=l 1 ij 

Write 

1 1 = I - 20 , v = r (1-26) - 1 a , 

e I is the P x p identity matrix and 9 is a real symmetric 
matrix, if E(f(x,S)) = o identically for all y,i, then 
from (2.47) we get 

C / f(x,s) | i _ 2 e |^ N | s (N-p-2) 

exp ^~^tr ( (I-26) (s+Nx x') - 2Na- + Na‘(l_26) _1 a ) } d x ds 

= 0 . 

or equivalently, 

C / f(x, s+Nx x' - Nx x') lsl^vN-p-2) 

i 

* eXp 'f-^tr (s+Nx x') + tr6 (s+Nx x') + Na'x} dx ds 

= 0 , 

(2.48) 

identically for all 6 and a. Identifying (2.48) as the Laplace 
transform of 


Cf(x, s+Nx x* -Nxx') |sT 2(N P~ 2) exp{-J$tr (s+Nx x') } 

with respect to variables Nx/ s+Nx x', we conclude from (2.48) 
that f(x,s) = 0, except possibly for a set of values of (X,S) 
with probability measure 0. In other words, (x,S) is a "" 
bound&dly complete sufficient statistic. Hence, if any test 

<MX,S) has invariant power function with respect to the induced 
group (p), then 

iKx,s) = <Mgx, gsg') 


for g e G t <p> almost everywhere, that is, * is almost invariant 
with respect (p) . By Example 2.12 we conclude that of all 


tests of H q s jj - 0 against alternatives H : y ^ 
function depending on N jj* I' 1 jj the Hotelling's 



powerful. 


0 with power 
T^-test is 
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2.4.2. Sufficiency and In vari^. T . . -, ' 

- -‘ J-t is well known that some 

simplification is introduced in a testing problem by characteriz¬ 
ing statistical tests as functions of sufficient statistics and 
thus reducing the dimension of the sample space to that of the 
space o he sufficient statistic without shrinking the parametric 
space. the other hand invariance by reducing the dimension of 
the sample space to that of the space of the maximal invariant 
shrinks also the parametric space. Naturally we now ask the 
following question: l s it possible to use both sufficiency and 
invariance simultaneously and if so in what order? Under certain 
conditions the reduction of the sample space can be done by using 
both sufficiency and invariance and the order is immaterial in 
such cases. The reader is referred to Hall, Wijsman and Ghosh 
(1965) for these conditions and some related results. However, 
one can avoid the task of verifying these conditions by replacing 
the sample space by the space of the sufficient statistic before 
looking for the group of transformation which leaves the problem 
invariant and then look for the group of transformations on the 
space of the sufficient statistic that leaves the problem 
invariant. 


[2-4.3. Invariance an d Unbiasedness : In testing statistical 
hypotheses the principle of unbiasedness plays an important role 
in deriving a suitable test statistic in complex situations 
involving composite hypotheses. 

A size a test <j>(x) is said to be unbiased for testing 
H Q : 6 £ ^ against alternatives 0 e if E Q <j>(x) a a 

for 0 e . 

In many complex testing problems the principle of unbiased¬ 
ness and the principle of invariance seem to complement each 
other in the sense that each is successful in cases where the 
other is not. However, as we will demonstrate below, if both 
principles can be applied successfully they lead to the same 
(almost everywhere) optimum tests. 

Consider the problem of testing H q : 0 e against 

o 

alternatives ; 0 e ... Let us assume that i£ is invariant 
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under the group G of transformation g and let C be the 

Qfc 

class of all size a unbiased tests (0 <. a < 1) . For any test 

(x) , x e x let us define the test <j>g by $g(x) =<f> (gx) , g e g, 

x e x. Obviously <j) e C if and only if <J>g e C . Hence if <j>* 

a ot 

the unique (upto a measure 0) uniformly most powerful unbiased 
test for this problem then 

V<t>*g(X)) = Eg (<(.* (g X ) ) 

= E- e (4>*(X)) 

= sup E- ($(x) ) 

<f>eC g6 
Y a 

= sup E (4> (gX) ) 

<t>geC a 

= sup E (<)>g (X) ) 

= E 0 (|<|>*(X) ) . 

Thus <t>* and 4>*g have the same power, function. From the assump¬ 
tion of uniqueness <J>* is almost invariant. It may be remarked 
that if the sufficient statistic is complete, the uniformly most 
powerful unbiased test in terms of the statistic is unique. 
Therefore if there exists a uniformly most powerful almost 
invariant test <J>** we have 



(X) > E 0 <j>*(X) ,00^ 


(2.49) 


Comparing <t>** with the trivial unbiased test . <j> (x) = a for all 
x e x we conclude that 4>** is unbiased and hence for 0 e ft 


E 0 ♦**(*» ^ E 0 d>*(x) . {2 . 5 0) | 

From (2.49) and (2.50) it follows that and <j>** have the j 

same power function.' From the uniqueness of <J>* we conclude that 
<t>* = <f>** almost everywhere.. 

Ex_ample 2.17. Let Xy —, X N be a random sample of size 
N from a normal population with mean y and variance o 2 . We are 
interested in finding the uniformly most powerful unbiased test 
of H : o =0 against H. s o 2 * a* where o 2 is five*. Wri* 


of H : o 2 = o 2 against H,: o 2 ± 0 2 
• N ° 1 o 

S 2 = l (X.-X) 2 . 

i=l 1 


where 


is fixed. 
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The size a test which rejects H q whenever s 2 /„ 2 s c, 
or sW a * C 2 Where the Constants Cj.Cj ere given by 


-X. 


X 


/ 2 £ 2 (s 2 l^)ds 2 = -ij , 2 s 2 f 2 (s 2 |o 2 ,ds 2 - l . . 

'*> ^ S s ° 

with 

£ (s 2 |cr 2 ) = 2- !s(N - 1) < 0 2 )-*<»-« e *p{->i lljt.aj'sW-M (2 . 51) 

s a 2 

is uniformly most powerful unbiased for testing H q against H 1 

This problem remains invariant under the group G of 
translations, that is/ X ± -► x ± + a, i = i,...,n. The 
maximal invariant in the sample space of (X^,...,x^) is S‘ 

T?rnm l 0 ^ 1 ^ 1 t* f nl 1 rtuo +-Vm+- vk/n • r_i ... . _ 


J.N 

From (2.51) it follows that no uniformly most powerful invariant 
test exists for this problem. 




Example 2.18 . Consider Example 2.7. The symmetric student 
t-test, which is uniformly most powerful invariant, is also 
uniformly most powerful unbiased for this problem. 

, Example 2.19 . Let (X^, ...,X m ) and (Y^, ...,Y n ) be random samples 
^gFpErom two normal populations with means and variances a 2 , a 2 

respectively and let 


_ 1 m 1 n 

X = — E X. , Y = - E Y . , 

m i=l 1 n i=l 1 


9 ^ 9 o i n ~ 

S = E (X -X) 2 , S 2 = — E ( Y.-Y) 2 

1 i=l 1 2 n i=1 i 

, 2^2 


. 2 2 

The problem of testing H q : a ^/ a \ ~ ^ a< 3 a inst alternatives 
2 2 

a 2 ^°i > k ' where k is fixed, remains invariant against the 
group of transformations 


X. 

l 


aXf + b^, i = 1, ...,m. 


^ maximal invariant in the sample space is 
s 2 

7. - m-1 s 2 

n -1 9 

S 1 
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2 2 

From Theorem 2.2, with <5= o~/ a . 


£„ (2 | 6 ) = £4 («)' 5(ra ' 3) 

l m— l 

r()s(n+r,-2)) ( ^T z)!S(n ’ 3> 

r (&s (m-a)) r(*s(n-D) (6+ Z )*s(m+n 

/ 

for z 5 0. Thus the distribution of Z depends only on 6 and 
for 6 < 6' f (z|6), f z (z | 6 ') are distinct and [ f g (z | 6') /f z (z 16) ] 
is a nondecreasing function of z. So the test which rejects H q 
whenever z > C, the constant C depending on the size a of the 
test, is uniformly most powerful invariant. It is also uniformly 
most powerful unbiased for testing H q against H^. 


Example 2.20 . Consider Example 2.?. For testing H q 
against H , Hotelling's T^ test is uniformly most powerful 

S ^ 

invariant. Whether it is also uniformly most powerful unbiased 
or-not is still an open problem. 


2.4.4. Invariance and Optimum Tests ; Apart from the fact that 
the performance of an invariant test is independent of the 
nuisance parameters, powerful support for Invariance comes from 
the famous unpublished work of Hunt and Stein towards the end of 
the Second World War which asserts that under certain conditions 
on the group G of transformations leaving the testing problem 
invariant there exists an invariant test which is minimax among 
all size a tests. 

It is well known that given any test function on the sample 

space we can always replace it by a test depending only on the 

sufficient statistic such that both have the sane power function 

Such a result is too strong to expect from the maximal invariant 

statistic, on the sample space. The appropriate weakening of the 

property and the condition under which it holds constitute the 

Hunt-Stein Theorem which states that if the problem of testing 

H o : ® e against H : 0 e Q remains invariant under the 

o l 

group G of transformations then given any test function <J> 
on the sample space, there exists an invariant test \|> such that 
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i inf 


Be* s 

sup 

0efi 

H 

o 

E e* 

V s 

inf. 

0efi 

E e* 


H 


(2.52) 


in other words y performs at least as well as 

cases. 


<J) in worst possible 


IheoremJLai. Let B be . o-field of subsets of G 

such that for any A e A (a c-field of subsets „ f the sample 

space x ) the set of pairs (x,g) with gx s A is in A x Band for 
any B e B and g s G the set Bg is in 8 . Suppose that there 
exists a sequence of probability distributions v n over <G,8) 
which is asymptotically right invariant, that is ” 


lim|v (Bg) - v n (B) | =0. (2.53) 

n-*“ 

Then given any test $ there exists an almost invariant test ip 
with respect to G satisfying (2.5). 

We refer to Lehmann (19 59) Chapter 8 for the proof and 
other discussions in this context. Giri, Kiefer and Stein(1963) 
and Kiefer (1966) are also referred to in this context. Thus, if 
for a given problem which is invariant with respect to a group G 
of transformation, Hunt-Stein condition is satisfied, then the 
uniformly most powerful invariant test will automatically be 
minimax, that is, will maximize the minimum power over . 

The group of translations of the real line, the group G in 
Example 2.4, the multiplicative group of orthogonal matrices and 
the multiplicative group of nonsingular lower triangular matrices 
are examples of sorre groups which satisfy the condition of the 
Hunt-Stein theorem. The full linear group G^(p) of nonsingular 
matrices does not satisfy the Hunt—Stein condition for p > 2. 

Thus the symmetric Student t—test of Example 2.4 is minimax 
for testing H Q :y = 0 against the alternatives H^y ^ 0. 

The Hunt-Stein theorem has been successfully used by Giri, 
Kiefer and Stein (1963), Giri and Kiefer(1964) , Linnik, Pliss 
and Salaevskii (1966) and Salaevskii (1968) to prove the long time 
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open problem of the minimax property of the Hotelling's T test. 

It may be remarked here that the conditions of the Hunt-Stei^ 
theorem, whether algebraic or topological, are almost entirely on 
the group and are non-statistical in nature. For verifying the 
admissibility of statistical tests through invariance the situa¬ 
tion is more complicated. Aside from the trivial case of compact_ 
groups only the one-dimensional translation parameter case has 
been studied by Lehmann and Stein(1953) . If G is finite or a 
compact group the most powerful invariant test is admissible. For 
the other groups the statistical structure plays an important role. 
The admissibility of Hotelling's T 2 test (and hence of the 
symmetric Student t-test) has been proved by Stein(1956) and 
Kiefer and Schwartz(1965) by using different methods. 


2 * 4 * 5 * Invaria nce and the Most Stringent Tests ; Consider the 

problem of testing H Q : 6 e against the alternatives 

o 

v 6 e n Hi where n a ^d n H are disjoint subsets of a 

With % u fi = 0. Let Q a denote the class of all level a tests 

of H q and let 

e*<e) = sup e e e n„ . 

* eQ a "l 




power thL C «ri th l enVel0pe P ° Wer fUnCti ° n 3nd ^ - the reximu. 

can be obtained at level a against the alternative 8 

Definition 2.5. A test $ that minimizes 

(B*(8) - E e (<|>)) 


sup 

9e0 


’H 


is called the most stringent. I n other worn 

minimizes the maximum shortcomings S 3 '"° St strin 9 ent 

If the testing problem is lnvari= _ 
transitions satisfying the cond ^ the ««>«• G of 

theorem and there exists a uniforl “ ° f the “^-stein 
invariant test ** „ ith respect Y Powerful almost 

** 3n eXample we ci te the symmetri ** is "»« stringe. 

^ "ost stringent. The '£%?*'** ^t. which i 1 

lri# Kiefer and Stein(1963) 
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For further details and supplementary reading in this context the 
it* reader is referred to Lehmann (1959, 1959a), Kiefer (1958) , Giri 
rt and Kiefer (1964a) . 


X. 


O* . Example 2.21 . Let ..., A 

'fl dis tributed random variables with 


. »X be independently normally 


an d common variance o satisfying’ y 
We are interested in testing the hypothesis H : 


means E(x i )=y i , i = 1, ...,N 


s+1 


= y N = 0, s < N. 


o' P 1 


= y. 


or 

‘U 


= 0, r < s. This problem remains invariant under three groups 
0 f transformations 


X i * X i + ai , 
x i * X i 


i = r+1, ... ,s 
^ !»•••, r, s+1, ..., N, 


orthogonal transformations of X. 


» » • • / x r , 


X. 

1 


aX i , a ^ 0, i = 1,...,N. 


With respect to G^,G 2 and G^ the uniformly most powerful invariant 


test rejects H when 
o 


:s 


- 0) 


r 

Z 

i = l 


2/ 

\ /r 


N 2 

Z xf/(N-s) 


> C 


i=s 


—ill 


where the constant C depends on the size a of the test. Under 
H q , W has the central F-distribution with r and N-s degrees of 
freedom. The power of this test depends only on 


:est 


6 2 = 


v . 2 . 2 
Z p /a 

i = l 1 


X 


Since each of the groups GG 2 and G^ satisfies the condition 

_ j 

of Hunt-Stein theorem this test is minimax and also most 
stringent. 

Example 2.22. The following data were collected from an 
experiment on Jute Crop in Sagar-Bishnupur, West-Bengal, India. 
The weights in gr*ams of green jute plants and dry jute fibre were 
* r ecorded for twenty randomly selected individual plants for each 
two different agricultural varieties grown under similar 
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conditions. Let x L = ( x i a ' X 2o^ ' a_ * ’ 


., 20, y_ = (y ,y ) , 
The numerical values 


(x = 1, ...,20 , denote the sample values 
are given in Table 2.1. 


Table 2.1 

Variety 1 

Plant No. Weight in grams 


Variety 2 
Weight in grams 


Green Dry 
Plant Fibre 


Green 

Plant 


Dry 

Fibre 


971 


2 

892 

63 

3 

1125 

70 

4 

82 

6 

5 

931 

65 

6 

112 

9 

7 

162 

10 

8 

321 

12 

9 

315 

20 

10 

375 

30 

11 

462 

33 

12 

352 

27 

13 

30 5 

21 

14 

84 

5 

15 

229 « 

14 

16 

332 

27 

17 

185 

17 

18 

70 3 

53 

19 

872 

62 

20 

740 

65 


760 

872 

1098 

144 

900 

120 

180 

200 

; .214 

440 

460 

220 

260 

410 

90 

270 

1080 

888 

209 

700 


Assuite that these observations are samples from two tivariat* 
normal populations with means U = (up,) v = (v„vJ 

respectively and the same covariance matrix l7 
Let us consider the problem of testing 


h 2 = v 2 


(ii) 


li * v 


M i 

i&53a« " kTv - ' 


■ ■ '' ■ 










■ 



m. 

M 


vnr Variety 1 : 

x = (x r ^ 2 ^ ~ (477.5o, 33.85) 

% * 20 

; | s l= (s ii' = ' *> <* a - X 
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X) 1 _j 253568 1 , 149056.5 


?n r Variety 2 : 


1490 56 . 5 , 108 


) 


38.55 


l = (y^j) ' = (475.75, 36.85) 

9 20 

S 2 = £> ( *“ - X> 


I __ 


= ( 


2239083 


/ 142218.2 


s " S 1 ' “2 


+ s ( 4374764 f 291274.7 
291274.7, 20 189.10 } 


14 2218.2 , 9350.55^ 


For (i) . Studenc t statistic with ^ 
f given by h 38 degrees of is 

»^80 ( x 2 - y 2 ) 


/ 


S 22 + s 22 


= -0.1415803 . 


fir ce We V * - at 5 percent level of significance, 

lot Hotelling.s T 2 statistic with 38 degrees of freedom 


T 380 (x - y) • s 1 (x - %) = 0.0 24410 2. 

Hence at 5 percent level of significance we accept 

EXERCISE 2 


H o : ± = 1- 


!. Show that if a quadratic form is distributed as a noncentral 
chisquare, the noncentrality parameter is the value of the 
quadratic form when the variables are replaced by their 
expected values 

2. Show that if f (x|6 2 ) is the probability density function of 

A 

e noncentral chisquare or of a noncentral F random variable with 
the noncentrality parameter 6 2 , then 

|x (x |6[) 

a n increasing function of x for all < a 2 . 
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. P distribution with r.U degrees of 
* chnu t-hat if X has central F di.s 

3. Show tnat ir a , Rpta distribution with 

„ _ v _ rY /(rX+ N) has the central Beta ais rtti 

freedom then Y = sx/'* AT ^ t ? 

, i c<-Vio distribution of log F. 
J 5 r,J$N -degrees of freedom. Find also j 

, vr-=-n random variable X with N 

4. Consider a chisquare (central) ranaom 

of freedom. Show that 


“ =Jl' 6 =i' *-f 


is a good choice for the rapid approach of the transformed 
variable 


“ [<f> 6 + ^ - n 

to normality as N -*• « . 

5. Let X 1# ... # X N be independent and let each X^^ be normally 
distributed with mean p and variance a 2 . Show that if 

N x i 

a 2 X i^S-i ~2 then ?* ^-) 2 is distributed as 

11 °i 1 ~1 o ± i=l °i 

o — 


Vl l{2) where « 2 = 0 when all ^'s are equal. 

6. Let s be distributed as Wishart w (N-l n 
any non-null p-vector L = (» l)t p , , ' * Sh ° W that for 

distributed as X 2 ‘ . .V’- ±' S ^k is 


distributed as X 2 

N-1 


7. Giri (1965) . 


t z be a p-variate complex normal rando 
mean 4 and Hermitian positive de^init triable with 

Show that definite covariance natrix Z 


E '- 18 diStrib uted as X 2(2a* fK , 

(b) T.^ r,a P ~ 2 ) • 


(b) Let Z a a - , P * ‘ ® * 

r r ■ 


.-i“ . rat „ 

- # S = y ( CL _ 

a=1 21 (2 ~ ,Z) ( 7 <* - 

a~i * —' \£ - z) * 

cu... .. 


let 




tp> 


(i) Sh °w that S 5 . 

<U>Th6 Pr ° bab ^it; den s 1 it n y d6 r nd6nt - 

£ S<b) -KU|-nuu |s |N-p_i fUnctiQ h of s is 

e*p{ J|trE-l l} 


I 






where 


S' 


b as 



4 


K -1 „ ^p(p-l) p r 

1=1 '*.• 


c Sr ATisr 


CAL concepts 83 


J(iii) Show that T 2 - m, 

2 . -N c " 


X 2(«-D * WhSre U 


2P an< i X . ' distributed a 2 

(iv) Among all size *** inde Pendent S X 2p (2N ^* E ' 1 oi) 

..tirkr»T.r« v £ Of JJ 


/ 


with power functions depend ° £ V 2 - 0 agai 
test which rejects H ^ ^ ° n Nt “ r\ t ! 2 M 0 

- o whenever - the Hkelihood ratio 


t 2 * k 
c 


iy 


where the constant k depends on th 

sost powerful invariant. , ^ « <* the test is u nitormly 


ds 


8 . Suppose that x= e N „ 

< the Euclidean N- Spa ee . 

(a) Let G be the groun of +- , 

group of translation of ( x ,. „ , , . 

(x.+a.x.. + a > ■ 1 " X N 1 


for 


(X l +a . X N +a) ’• Show that 

(*1 " x) 1 t 


in x to 


where 


N 


X N 1 = 1 Xi 1S 3 103X11,01 inva riant on x under G. 


(b) Let G be the group of orthogonal transformations of x e x 
Show that x'x is a maximal invariant under G. 


9. Let X 1# ...,X N be a random sample form the normal population 
with mean u and variance o 2 and let G be the group scale of 


scale changes, that is, g e G, gX i = ax^ a >0. Show that the 

uniformly most powerful invariant test of H p i 0 against 

2 ® 
li > 0 when a is unknown rejects H q whenever 

x 


N 


* C 


1 " _ 2 

srr i <x -x> 

1 = 1 


, _ >.u. eize ct of the test. Show that 

where the constant C depends on the size a 

it is also uniformly most powerful unbiased. 

, _ sample from the normal distribution 

10. Let X x . *n be a ra ” show that among all tests of size « 




with mean u and variance 

2 


for H_:o^ * 1 against H,: o 


SHOW UIOS -- . . . _ 

2 < i, the size « test which rejects 


»o w henever 



- 


' 


ft 
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» 9 

E (x. - x) < k / 
i=l 1 


is uniformly most powerful invariant 

y be random samples from two nor^ 

11. Let V"V Yl ' n ^onectively and the common 
populations with means y^ V 2 
2 _ . 


variance o . Let 


1 = 1 X — J- 

'( + 7 ) (y - x) 

_ m n 

JJL 

V m+n-2 

show that the uniformly most powerful invariant test of 
H : u„ - u < 0 against H : y 0 - y n >0 rejects H whenever 

O 2 1 12 1 

t > C, the constant C depending on the size a of the test. 

Show that it is also uniformly most powerful unbiased. 

2 

12. Show that the distribution of Hotelling's T statistics 
under alternative hypotheses remains unchanged if y. is replace 
by (<5/ 0,.../ 0) 1 and I by identity where o 2 = N_y' Z~ ^ y. 

13. Consider Example 2.9. 

(a) Show that U = NX' (S+NXX') ~ 1 X = NX'S _1 X / (T+NX'S -1 X) . < 

(b) Show that the- uniformly most powerful invariant test of H 

against Hj, rejects H q whenever u £ C. Show that the ° 

probability density function of u under H q is given by 

f u (u) = ~[isp) N J'^(N-p) )■ ^ 1 a-^ , "- p| - 1 

* exp {-Ij6 2 } Cw-p), ij p ; !j U a 2 ) 

»»ere is the confluent hypergeonetric series given by 

<Ha,b,x) = 1 + — x + 3 (a+1) x 2 

b b (b+1) 21 + • • • • 
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r c of C‘ are vectors in ir 0 which are perpendicular to tt 

L- $ • • • t -■£ U) 

Clearly Y has the N-dimensional normal distribution with E (Y) = 

v _ ( v v ' ' = CA0 and covariance matrix a I and v satisfies 

jl ■ • • • n' 


the restriction v 


— ... — v — 0. Under H : v. ... — v — 0 


s+1 ”* 'N " " o 1 r 

Thus by means of an orthogonal transformation of the above form we 

reduce the general linear hypothesis in the canonical form which 


concerns independent random variables Y^, ...,Y having normal 


distributions with variance a and with irea 
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t 


i > s 


(3. 3] 


V 1 = 


= v =0 
r 


vj j. r 

Of l' 1 -' Let “ COnSider the geometrical interpretation 

of the above fact for the case N = 3 with 
/ \ 



e 


■ ( V 6 2> 


The rank of A is 2 ana y, = e ,, _ . 

subspace of E 3 where = V 2 V ^ - 6.,. 
subsnace . 1 ^2* Under H^, e „ = 0 


So i s a 


subspace of with ^ 1 is * 

r r ■** *■*is." ;;;r normaiizea 

s^rpendicular to the second row Hen *"* ( °'°' 1) and 

(VA ' 1//6, . 2//6) . SiaalarlvTh ! “ ±S 9ive " by 

Perpendicular to th* *■ ^ he third r °w of c , . 

er to the first and the . C ' whl ch i s 

mean vect °r v is COnd row is ( 1 // 2 , _ ( 



0) 


2__ 

^ ~ ■ 

^ometricaHy We r 3 

T! Sent th -- —e aa. 

hyi = 1... i 

fiance o 2 y dlstr ibuted 

j ana 1 a 74» v. 


fe* 


u r 

t= *.*• w;r*r ,( v 

“* ’ * - - , X 


;jiV --huu^-'H. v 

1^-,. yP thesis «o= U, = 


rar . j * he 

ndom variables with 
) = u lth common 

i# J = x * •..,N., 

- _ *i * 


11-, X 


is given by 


IN I ) *' 


• • 


= y. 


s = -I 

s -r = i. 



IJN I VAR I ATE 


GENERAL LINEAR HYPOTHESIS 89 




is spanned by the column vectors of A and n is spanned by, the 
single vector whose components are all equal to unity. 

This is called the one-way classification. The following 
numerical data were collected from an experiment with four 
different varieties of wheat at the Indian Agricultural Research 
Institute, New Delhi. We shall use the data for further 
discussion with one-way classification. 

3.1.1. Invariance of General Linear Hypothesis : The canonical 
form of the general linear hypothesis in terms of Y^, ,Y 
remains invariant under the group generated by the following 
3 subgroups: 



Group of translations G^, translating 
X + S Y (Y*.Y*) • 

with Y* = Y i + a^, i = r+1, ...,s, Y* *- Y^ otherwise. 

Group g 2 of orthogonal transformations of Y^ ...,Y r . 

Group g 3 of scale changes, changing Y to gY = aY with a ^ 0. 
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Table 3.1. Weight 


Plant No. 


Weight in grams per 10 ears for fourteen 
randomly selected plants for each of four 
different varieties of wheat grown under 
similar conditions. _ 

Varieties 

12 3 4 


1 

13.7 

2 

13.9 

3 

17.7 

4 ! 

17.3 

5 

13.8 

6 

15.7 

7 

16.3 

8 

13.1 

9 

13.6 

10 

13.€ 


The joint probability density function of Y^, ...,is 

(2ir) -i * N (o 2 ) exp [- £ (y^v ) 2 + £ y. 2 >] (3. 

2cr i=l 1 1 i=s+l x 

^ 2 

Thus (Y.,...,Y , £ Y ) is a sufficient statistic for 

1 s . _, i l 

2 l—s 1 , 

(v 1# ...,v ,o ) . So sufficiency reduces the problem to the 
1 s N 2 

consideration of (YY , Z Y. ) . In the space of this 

i=s + l 

sufficient statistic a maximal invariant under is 

N 2 

(Y ,...,Y , Z Y ), a maximal invariant under G-, G. is 
i=s+l 1 12 

r 2 N 2 

(U= I Y , V = Z Y. ) and a maximal invariant under 
i=l 1 i= s +l 1 

an< ^ ^3 is U/v. Thus the principle of invariance reduces 
the data to the single statistic U/V or equivalently to the 
statistic 
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*1 


Eacb ° f the 3 9rOUpS G i- 1 - l.a.5. Of transformations induces 
in the parametric space the following group of transformations 

G. . is the group of translations v * gv = v* = (v*. v *) • 

iith v, * = v. +a,. i = r +i ^ -- 1 N 


with V =v i +a i' 1 = r+1 .a; v.* = v , i . _ 1 r and 

V. * = 0 otherwise, which leaves (v * \r * 2 \ *.-l. i 

i vcs tv , ...,v *,o ) as the maximal 

invariant in the parametric space. 

G 


2 is the^group of orthogonal transformations of (v , ...,v ) 
which leaves v. / 0 2 ) as ^ ^ ^ 


parametric space under S and g 


2 * 


G 3 2 is the group^of scale change v - av, a * 0, which 
leaves 6 - v. /a as the maximal invariant in the parametric 

space under G^G^G^ The distribution of W depends only on 6 2 

arw^ Viae _^ _ - - 


and has the noncentral F distribution with r, N-s degrees of 
freedom. From Example 2.21 the uniformly most powerful invariant 

4 _ TT 


test rejects H :v. = 
o 1 


— v r - 0 whenever 


* M 

Vl 


0) > C 


% 


4 

- 


I 


(3.5) 

where the constant C depends on the size a of the test It is 
also uniformly most powerful among all tests with power functions 
depending only on 6 . If r = 1 the test given in (3.5) reduces 
to the symmetric Student ’s-t test and is uniformly most powerful 


i 


unbiased for testing against a,: v, / 0 . 0 n the other hand 


O “1* V 1 - -- 

no uniformly most powerful unbiased test exists if r > 1. 


3 * 1 ’ 2 * L east Squares and the Canonical Form a In actual practice 
it is .usually not very convenient to carry out explicitly the 
reduction of the general linear hypothesis to the canonical form. 
However, we can express the maximal invariant statistic W in terircs 

of original observable random variables x ± .by the theory 

°f least squares. Let * 


* 


S = - E (Y) ) ' _ E (Y)) 

2 


s 

= (y ± - v.) 

i=l 1 1 


N 

Z 


i=s + l 


f 


■ 






(3.6) 
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in 2 

Since 1 y- 2 is the minimum value of S z without any 

i=s+l 

restrictions on the values of v^, ...,v s an< ^ the orthogonal 
transformation % = ex leaves the distance unchanged, that is 

S 2 = - E (Y)) 1 (£ - E (Y)) 


= U - ji) ' (x - jj) 


N 


= 1 (x - y ) 

i = l 1 x 


N 


The minimum val ues of l (y - v ) z + E y y , under variations 

i=l i=s+l 1 

of v i # *‘*» v s is also the minimum value of (x - jj) 1 (x - jj) 


a ■ Thus if i = < Oy .C N )' is 


under variations of _y in tt 
the least squares estimate of jj under tt , then, 

tU w 

N 


i=s + i Yi = ( x -0) • (it - 5) 


= min(x -u) «(x-y) . 

TT ~- 


n 


Similarly under ^ the minimum value E y. 2 + " y . ; 

. i.= l 1 i=s + l 1 


of 


i . 1 y i" + . 2 (y i -v.) 2 + %. 2 

\ 1=r+1 1 1 iJ+i 1 

can be expressed as 


r 

’f 2 
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L yr + i v 2 


i=s+i 


~ (* - £) ' (x - £) , 


* i=l 1 

where y i s . 

r ' , 2 ~ 6 l6aSt Squares estimate of 

- h) ' (x - y) _ 


So 


"P <* - jj) ' <x-jO . 


U under tt , 

— w 
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co 
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^otetricauy 

The vectors 1 2 the ". M in n g ur. 3.2. 

felij; r especti vely< ’ iL are ihe projections of x on it 
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Figure 3.2 


3.2.REGRESSION MODEL OF THF 

F THE general linear hypothesis 

We shall now discuss a « 

linear hypothesis which is lllT ^ ^ 9eneral 

from experimental desicms. v Pria * f ° r anal Y sis of <*ata 

1. \ ° e a sequence of N 


independent random variables with 


E(X ± ) 


P 

= Z 
j = l 


8.x, 
J ij 


Var (x^) = a 2 . 


(3.8) 


1 — where 8 r , 

i=1 „ . . . 6 P are unknown quantities ana x 

:::i ti 

with independent covariance structure. In the vector notatL 

~ . .V - = (8 1.V the matrix 

tensions N x p, we can rewrite (3.8) as 

E(X) =X8 

E(X - xB) (x - x 6) • = o 2 i 

We shall assume throughout this chapter that Nip. lf the ran> 

** a the m ° del iS Called 3 m ° ael ° f ful1 ™«*. Linear 

ere the matrix x is » ot ot full rank is of common 
hrrence in experimental designs. In this connection the 
trix x is called the design matrix. 


x = (x i:j ) of 


(3.9) 
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In experimental designs the unknown quantities 0^,...,3 

are called effects and the known constants x^j are called 

countervariables which count the number of times the effect 3^ 

occurs in the ith observation x^. In most situations of 

experimental designs x^ usually take the values 1 or 0 indicating 

the presence or the absence of the effect 0^ in x^. It is not 

always that x^j take the values 1 or 0. The cases where x^ are 

continuous variables are referred to as models of regression 

analysis. The mixed cases, where some x,. are countervariables 

ij 

and the rest are continuous variables, are called models of 
analysis of covariance. 

The unknown effects 3^ may be constants or random variables. 
If all 3^ are constants we call them parameters of the linear 
model and the model is called a fixed effect model. If for some 
j» Xjj = 1 for i = 1, ...,N, then the corresponding 3j is called 
an additive constant. If a model is such that all 6^ are 
observable random variables, it is called a random-effect model. 
In a mixed effect model some 3^ are parameters and some are 
random variables. We shall consider mainly the problem of making 
statistical Inferences about the parameters' in a fixed effect 
model. 


3.2.1. Least Squares Estimate of 3: Let be a sample of 

observations on X^,...,X N . The least squares estimate of the 
parameters £ in the linear model is the value B = (B,,...,B ) 1 of 

“1 p 

£ for which 

S 2 = (x - xS) • (x - x£), 

where x = (x^, ...,x^) • , is minimum. Obviously given X, B. is a 
'function of x only. Differentiating S 2 with respect to 0* the 

_ a 

value B of £ which minimizes S z is given by 


N 

l 

i=l 


x., (x. 

lk i 


} ml Vij* ■ °' k ■ l * 




In matrix notation we rewrite 


(3.10) 


as 


X'Xj = X*x . 
The equations 

X^xj = X'x 



(3.10) 


(3.U) 


(3.12) 


• • 
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(3.13) 


are called normal equations for the linear model and it is a 
nonhomogeneous system of li„e ar equations in B. If X has the 
full rank p then X'x has a unique inverse (X'X)' 1 and the 
unique least squares estimate of g j. s given by 

B = (X'X) -1 X'x . 

In the general case the normal equations (3.11) always admit of a 
nontrivial solution, though, obviously it is not unique. This is 
because the vector X'x belongs to the column space of X- which is 
the same as the column space of X'X. If (X'X) - is any generalized 
inverse of (X-x), then a solution of (3.11) is given by 
(see Chapter 1) 


I = < X ' X > X* X + (I-(X'X) (X'X)")z , 
where is any arbitrary p-vector. 

2 

Lemma 3.1 . S 
(3.11) . 


(3.14) 


has a unique minimum for all solutions j[ of 


Proof. Let B be any solution of (3.11) . Then 
S 2 = (x - XJ3) ' (x - xg) 

= (x - XB + X(B - $)) • (x - xB + X(J - g)) (3. 15) 

= (X - xB) ' (X - xB) + (B - 6) 1 XX • (B - B) . 

Since the second term in the above expression is nonnegative, S 2 
attains its minimum value (x - xB) • (x - xB) when 3 = B and this 
minimum value is unique for all solutions j[ of (3.11) 

A least square estimate of any linear parametric function L‘{3, 
where L= (1^ ..., l^) • is a nonnull p-vector, is defined by L'j| 
where is a solution of (3.11) . Obviously Ij'JS is unique if x 
has full rank. We shall now investigate conditions under which 
a linear parametric function L'g admits of a unique least squares 
estimate in the general case. To do this we need.the following 
concept of estimable functions. In what follows we shall not 
distinguish between an estimate and the corresponding estimator. 

^ estimator corresponding to any estimate is obtained by replacing 
.£ by X = (Xj # ...,Xjg) ' in the expression for the estimate. 
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Definition 3.1 , A parametric function (or a parameter) is 
estimable if there exists an unbiased estimator of the parametric 
function (or the parameter) which is linear in the components of 

X. 

Such estimable functions are usually called linearly 
estimable. For convenience we prefer to call them simply 
estimable functions. The concept of estimable functions is due 
to Bose (1944). It is easy to observe that not all parametric 
functions are estimable. A condition of estimability of the 
parametric function is given in the following theorem. 

Theorem 3.2 . The linear parametric function L'j? is estimable if 
and only if the equations X'X\ = L in _X has a solution. 

Proof . If L'&_ is estimable then there exists a linear 
combination b'X,where b =( ...,b ) ' -is a real N-vector, such 

that. 


E (b 1 X) =b ' X6 = L'jJ for all £ . 

; 

Hence X'b = L. This implies that the rank of the vector subspace 
spanned by the column vectors of X equals the rank of the 
augmented matrix (X 1 |l) . In other words the rank of X'X equals 
the rank of the augmented matrix (X'X|L) . Hence X'XA = L in ^ 
has a solution. 


Conversely suppose that X'X _A = L has a solution for X. 
writing b = X A. we get b'X = L' . So b' X_B = L'jS for all £. 
This implies that b'E(X) = L'_B and hence L' B, is estimable 
and b'X is its linear unbiased estimator. 

Q.E•D« 

Comment . Since X'XX^ = L implies that X'b = L, the above 
theorem can be stated in terms of the condition that L is a 
linear combination of the column vectors of X'. 



We now state the conditions under which a linear parametric 
function L '_B admits of a unique least squares estimate. 

Theorem 3.3. A linear parametric function L'J3 admits of a unig1 
least squares estimate if and only if L'B is estimable. 





e 
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roof * If L'J3 is estimable 

#^ Theorem 3.2 # L belongs to 
column space of X'X. if -r(D ,( 2 ) ~ g 

solutions of (3.11), then (i<D m"!). 1 “* tWO 

column space of X'X. Hence «(U \tl “ the 

¥ which implies that L'^ u; = l ,j(2) " 

Conversely if L'B admii-«= 

- s of a unique least squares 

estimate it follows that any vector which is orthogonal to the 
column space of X'X is also orthogonal to L. This means that 
L belongs to the column space of X'X and helce L'B, is estimable. 

Another equivalent condition of estihability of the^near 

parametric function, which is easier to check, involves a 

generalized inverse of the matrix y<v ■ , 

c matrix.x X and is given in the 

following theorem. 

Theorem 3.4 . The linear function L'B is estimable if and only 

if L'tt-(X'X) (X'X)) =0 where (X'X)" is a generalized 
inverse of X'X. 

Proof. By Theorem 3.3 L'B is estimable if and only if L'B 

admits of a unique least squares estimate. Since a solution of" 

(3.11) is given by (3.14) and z is arbitrary, it follows that we 
must have 

k' (I - (X'X) "(X'X)) = 0 . 

(3.16) 

We shall now investigate the optimum properties of the’least 

squares estimator of any linear parametric function. Since the 
joint distribution of x Y . . 

uill s rlDutlon of x l. 1S not reified, in general it 

will not be possible to examine any optimum properties of the 

least squares estimator in relation to all possible estimators 

nstead, we shall investigate its optimum property among all 

lihear estimators, that is, estimators which are linear functions 

x r * • *'*n* 

asare^Ls. ( G a uss-Markoff ) . if the linear parametric function 

- is estimable then the linear estimator L'B where E i 
solution of - p. is a 

X ' x £ = X'X 


(3.17) 
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is best linear unbiased, that is, best (minimum variance) among 
the class of all .unbiased linear estimators of L'j$. 

Note : A best linear unbiased estimator is often called BLUE 

Proof . Since L'ji is estimable, from Theorem 3.2 we get 

X'XX = L for some _X . Hence 
L'B = X'X'Xj = V X’X . 

Therefore, < 

E(L'B) = E(X* X'X) 

= X* X' X$_ 

= L'B . . 

Let b'X be any other linear unbiased estimator of L'B . Since 
b'X is unbiased for L'jS, we get b' X = L' . Now 

Var(b'X) = Var (b'X - L']j + L'l ) 

= Var (b'X - _A' X' X + L'B ) 

= Var (b'X - X'X'X) + Var (L'B) 

^ Var (L'B) , 
as 

Covtb'X - 1'X'X, L'B) 

= ( b' - X'X') XX o 2 
= ( b'X - X'X'X) X_ o 2 
=(L' - L'U a 2 
= 0 .- 

Prom the argument following Theorem 3.2 and Theorem 3.3 it foll°« s 
that if L- 6 is estimable, the .unique best linear unbiased 
es imator of L'£ is L'jj , where jj i s any solution of ( 3 . 17 ). 

rnr^re^' t lf b) 6 llnear P aramet tlc function $ = . . 

are estlI ® b l a . any linear nomination of * is 

also estimable. ( 1 .*k 1S 

_ k 

P roof . Let $= z a <j, where a 

constants and let V 1:51 i! 1 ' . .\ real 

i e unique linear unbiased estimat° r 


of V 1 * 1 . . Then 1 K \ 

unbiased linear estimator f i=1 ** * 1S obvioX3sl Y the unique 

° r • Thus 4 i is estimable. 

It is easy to conclude *w . k Q - E - D - 


^ iS easy to conclude that * . * , . Q ' 

| is any solution of ( 3 . 17 ) i=l ** L 'i^' where 

Theorem 3.7 . All linear M >- 

if and only if x has full raxfc ^ fUnCti ° ns in £ are estimable 
Proof , if x has full rank i-h~ 

the unique solution 6 = ( X i X )-l , normal equations ( 3 . 11 ) have 
estimator of L'B for any a Hence —'1 is th e unique 

then by Theorem 3.2 l = X'yi Conversely if L'J3 is estimable 

X'X must be of full rank * SOne Since £ is arbitrary 

If X is not Of full rank c . . Q.E.D. 

not admit of unbiased linear est^to^^ fUnCtl ° nS dc 

up nothing can be inferred about "them” Lin ^ Pr6Sent 

are not e “ •»—- - '•c:::ir e n: s r tion£ 
. zr t :r riuinir t r so as to - 

L . * full rank. Such a reparametrization is 

r ainly aChleVed by means of the transforation a = aB. where A is 
a matrix of known elements such that each component of a is an 

estimable function of 8 . This will be discussed in more detail 
in the experimental design model. 

Let r be the rank of X and (X'X)" denote the generalized 
inverse of X'X. Since E (X) = XB, all parametric functions X£ 
are est imable and hence, by Theorem 3.3, x^ admits of a unique 
least squares estimate x£ , no matter what solution B of 
(3.11) we take. Let us take in particular 

I - (X'X) X'x . (3.18) 

From the identity 

E(X 1> = E (X (X'X) "X'X) 

= X(X'X)”X'Xe 
k = X 6 


ice 


lows 



.£# it follows that we must have 
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X(X'X)“X'X = X . 


(3.19) 


(3.20) 


An estimator of a based on the unique least squares estimator 
XB of xj* is given by 

° 2 = (x-xB) • (x - xl) 

= 7ibl7 x' d-x(x*x)’x') (i-x(x«x)"x')x 

= 7S"_r) ( I -x(x'x)"x , )x , 

as by (3.19), 

(I-X(X'X)‘x') (I-X(X' X) “x 1 ) 

= i-xtx'xl-x*. 

In view of (3.21), the N x n matrix I-X(X'X) - X' 
and its rank is equal to 

tr (I-X(X'X)“X) = N-r . 

Since 

X' (I-X(X' X) ~ X') x = o, 
wte can write a 2 a s 

° 2 = 7 N -r) ( X" x i) ' (I-X(X'X)-X')(X-X3) . 

10105 —a 2 is an unbiased estimator of a 2 . 

Proof. 


(3.21) 
is idempotent 


(3.22) 


E (a 2 ) = 


J N ~ r ) ° tr(I-X(X'X)“X') 


_ N^r a 2 = 2 

N-r 

In what sense is o 2 an optimum estimator of a 2 beyond being 
unbiased? This question was examined by Hsu (1938) and Rao(19! 
Hsu considered the quadratic estimate of the form X 1 AX, which : 
unbiased for a and which has minimum variance among all 
quadratic unbiased estimators with variances independent of £ 
too also considered unbiased quadratic estimators of the form 
X'AX which have minimum variance among all unbiased nonnegati* 
quadratic estimators, under either condition it has been show 
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that o haS tlle minimum variance when the Pearsonian coefficient 
of Kurtosis of the distribution of each is equal to 3.. In 

the case when this coefficient is not equal to 3, "a 2 has 

minimum variance when the coefficients of x 2 , .,.,X 2 in X'AX are 
0 all equal. 

If X'X has full rank, then, 

d 2 = In z' d-xtt'xr^Mx. 

Let L' Li 1 ^ be two estimable parametric functions and let 
L' L' 2 ^ their least -squares estimators. 

Lemma 3.9 . 

Var (L ' jjS) = 0 2 L ^(X'X) "L 1 , 

CovtL*^, L' ^B) = o 2 L' ± (X'X)"L 2 . 

Proof . Since L'.jj? is estimable, L^ = (X'X)_A for some . 
By (3.18) 

Var (L* 

• . = Var(L' (X'X) - X'X) 

% 2 ' 

fj = o L'-^X'X) (X'X) (X'X) L 1 

= a 2 A/ (X'X) (X'X) - (X'X) (X'X) ”L 1 

= a 2 X> (X'X) (X'X) ~L 1 

= a 2 L ' (X'X) - L 1 . 

c °v (L'^, L' 2 ^) 

= o 2 L' (X'X) "(X'X) (X'X)“L 2 
= a 2 V(X'X) (X'X)“ (X'X) (X'X)"L 2 
= o 2 V X'X (X'X) "l 2 
= ° 2 L' x (X'X) "L 2 . 


k 


If (X'X) is of full rank, then all parametric functions are 
e stimable. In particular, all components of J3 are estimable and 
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VarlB^ = o 2 Cii , Cov(B., B .) = o 2 C., 

j J 


<3.23) 


where 


C =(C ij ) = (X ' X) 


-1 


^•2*2. Geometrical Approach to Least Squares; Let P be a 
projection operator (matrix) which projects vectors on the column 


space of X. By the definition of projection we get PX = X and p j 



is a symmetric idempotent matrix. 


Let b'X be an unbiased linear estimator of L'j3. Since 

E (b X) - L' for all b'X = L' . Consider Pb and the linear 
estimator (Pb)'x=b'PX. Now 


r Thus 


E (b 1 PX) =b'BXB_ 

=b'X£ 

=L'i . 

Var (b'X) = Var (b" (I-P)x + b'PX) 

= Var (b* (I-P)x) + Var (b'PX) 
* Var (b'PX) , 


(3.24) 


(3.25) 


as 


b'PX= 


Cov(b* (I-P) X, b'PX) 

= 0 ^' (I-P) Pb = 0, since P is idempotent 


3.2.: 
jjstji 
an (j. 
rtiere 
w e sT 
unbi- 
Thi s 

H 

K 


\ 


Hence b'PX is unbiased for L'B . and has smaller variance than 

b'X. If a 'X is any other unbiased estimator of L'ethen 

- ci’lx = 0, that is, b - o is orthogonal to the columns of * 
and therefore P (b - a ) = 0. So 


it 


a'PX = b'PX 
Hence 


\ 

^[\i 


: i 


Var(a'X) ^ Var (a'PX) = Var (b'PX) 

If XX is the projection of the vector b on column spaee X 
X' (b - XX) = 0, 

that is. 


(3.2«> 


. N 

| N 






\ 


li 
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so we can choose the projection matrix p as 

p = X(X'X)"X' . 

Hence 

= X(X'X)-X'b l 

■ = X(X'X)~X'L , 

b .px= L* (X'X) ~X'X 
= L '|_ . 

Thus b'PX is the least squares estimator of L'£ . 

3 . 2 . 3 . The Best Linear Unbiased Estimator and the Least Squares 
Estimator for Correlated Observations : Let X = (XX )' be 
an N-dimensional random vector with E(X) = x£, Cov(X) =a 2 V, 
where V is a symmetric positive definite matrix. In this section 
we shall investigate conditions under which a best linear 
unbiased estimator coincides with the least squares estimator. 
This model is a slight generalization over the linear model (3.9) 
where V = I. We shall first consider the case where X has full 
rank and then we shall discuss the general case. 


Since V is a positive definite matrix we can write 


= BD' , 


(3.27) 


where D is a nonsingular matrix. Writing X* = D~ x 

E(X*) = D _ 1 X_3 , Cov(x*) = a 2 i. (3.28) 

( From Theorem 3.5, the best linear unbiased estimator 
of J3 , assuming X has full rank, is given by 

| = (X'V _ 1 X)' 1 X'V' 1 X , (3.29) 

with covariance matrix 
Cov(B) = a 2 (X'V - 1 X) _1 . 

The least squares estimator B of £ is 

B = (X'X)” 1 X'X (3.30) 

^ with E(B) = J3, Cov(B) = a 2 (X'X)" 1 (X'VX) (X'X)" 1 . 

^ Let us now investigate conditions of equivalence of B_ and 
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Madness and McGuire (1962) exandned the simplest ease p = 1 that 
is, 6 is a scalar quantity B, with the added restriction that 
X«x = 1. For simplicity they took Var(X i > = 1 for all i. so that 

is the nonsingular correlation matrix R of X. Let 6 X . 

be the characteristic roots of R and let t ' le 


corresponding normalized characteristic vectors 


So 


Ra. - 0 4 a 

—l — 


^ i i — 1# • • • t N # 


ou ' a ^ = 0, i ^ j . 

Since the set of N orthonormal characteristic N-vectors c^#...#^ 
constitutes a basis to the N-dimensional vector space# we can write 
N 

X = 2 C. a. 


i=l 


1 —1 


. § 


C. = 1. 


i=l 


where are scalar constants. From the condition that 

-*■ W Mo 

X'X = 1 and ct^'s are orthonormal vectors# we get 
Thus for the best linear unbiased estimator § , 

Var(B) = (X'R -1 X)" 1 

N N . 

= [( l C a ) 'R _i ( £ C. a KT 1 . 
i=l 1 1 i=l > ~ 1 


Since cu are also the characteristic — -c "-l 


. ^ 

.c vebtors of R” 

to the characteristic roots 6' 1 , i = l.N, that is, 

— 1 — 1 * 

R ot^ = 0^ —i ' we have 


N . N C. 

1 c i R “i - S ( -ir > “■ 
i=i 1 1 i=i -1 


So 


Var(B) = 


N 

£ 


-1 


C/ 
1 

i-i *7 



For the least squares estimator B of $ 

Var(B) = X'RX = £ c 2 e 

i=l 


-i . 


Generally Var (B) s Var (Hi 

holds is given in th * 6 Con<iLtion under which equali^ 

fe *N6fe*.,. 0 ° llowing th eorem which is due to Matfnp* 5 
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w GUire (1962) . 

■tuess m 3 - -^- Var(g) = v «( 6 ) if and only if x is a 

characteristic vector of R. 

P proof,. If X is the characteristic root of R corresponding 
to the vector X, then, 

R ' 1 X = jXa RX = XX. 

So 

3 = X'X = B 

and hence Var (B) = Var(p) = \ m 
Conversely, suppose that 
Var ( 6 ) = Var ( 6 ) = X. 

Let X 1 be any unit vector perpendicular to x. 

Then 

RX = Xx + yX 1 , 
or 

Xx = RX - MX 1 , 

where X,y are scalar constants. So 

' 

X'RX = XX'X = X > 0. 

Now, x = XR _1 x + UR~ 1 x 1 


So 


R_1 X = jX- H R -1 x 1 


X'R” X x = X - X X' R~ 1 x i 


Hence, 

x = Var (B) = (X'R" 1 x)" 1 


= X (1-yx' R~ 1 X 1 ) ~ 1 


(3.3i: 


Since X > o we conclude-, from (3.31) that either y = o or x'R'V 
= 0 * But X. perpendicular to X implies that y = o. Hence 

Tiw »_ - 1 


R X “ *X and R x = X/X, which implies that X is a characteristi 


sector of R corresponding to the characteristic root X 


Q.E.d. 


In the general case with E(x) = XB, Cov (x) = o 2 v, where V 
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is a symmetric nonnegative definite matrix and! X is not of 
rank, Zyskind (1967) proved the following theorem. 


Theore m 3.11 . Let r be the rank of X and let L'B_ be estimable 
The best linear unbiased estimator of L'£ coincides with its 
least squares estimator if and only if there exists a subset of r 
characteristic vectors of V which constitutes a basis of the 
subspace C(X) spanned by the column vectors of X. 


P roof . Suppose that V has r orthogonal characteristic 
vectors which form the basis of the linear subspace spanned by 
the rows of X. Let M Q be the r x N matrix whose rows are these r 
orthogonal characteristic vectors. The other N - r characteristi< 
vectors of V form a basis of the orthogonal complement of c(X) 
and let be th4 (N — r) x N matrix whose row vectors are 


these N - r characteristic vectors. ,Obviously M^X = 0. 


Let M = 



and let Y = MX. Then 


E(Y) = 


(M Q XJ3) , 


Cov(Y) = o 2 MVM' = a 2 A, 


where A is a diagonal matrix whose diagonal elements are the 
characteristic roots of V. If the rank of V is less than N then 
some of these diagonal elements are zeros and the others are 
positive. Thus the best linear unbiased estimator of any 
estimable parametric function L'B should only be a function of 

**)*■ 


Let b'X be any unbiased estimator of L'£ and let b = a+a’ 

where a 1 is the projection of b on C(X> and a is the orthogonal" 
complement of a. Now 


E(b'X) = E (a’X) + (a 1 ) *X) 

= E(a'X) + (a 1 ) 'X3 

= E(a'X) . • » 

Thus a'X is also an unbiased estimator of L'6. Again, since 

M o 

Ma =( 0 ) a 
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M o £ 

= ( 0 ) , 


1 / 
Ma = ( 


M 1 £ 


Vai-(b'X) = a b' V b 


2 s 

= a (a+a 1 ) ' v(a+a x ) 

- a (a+a A ) , M'DM(a+a 1 ) 

= ^ta’M^Ma + (M^a 1 ) -D^a 1 )] 
* ° 2 a.'MiD 0 M 0 a 
= o 2 a' Va 
= Var (a'X) , 


where 


D = 


Thus the best linear unbiased estimator of L'B is a'X, in 
which a belongs to C(x) . Moreover since m q is a basis of C (X) 


a'X = a' M^ m q X 


= E' (MqX) , 

where E = M Q a . Finally since C (X) = C(M Q X), the best linear 
unbiased estimator of L'B is the least squares estimator which 
is the unique unbiased estimator of L^B f or which LeC(M Q X). 

Conversely, suppose that the best linear unbiased estimator 
of the estimable parametric function L'B coincides with the leas 
squares estimator. Let M = ( “° ) be an orthogonal matrix such' 

that M 1 X = 0. Let Y = MX. Then 


E (Y) = ( 


M 0 XB 


Ce ^' ^0— ^ es t linear unbiased estimator of 


m 
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HTHt 

fe ft 


/v) is Of the form 
c£. This implies that Cov _ 


,\ qrruares estimator of M Q X3 is best linear 

otherwise the least sq matrices such that 

. , . T P t r R be orthogonal matrices s 

unbiased. Let k q /K 1 


R o w o R o = A 0 ' 


R i w i R i = A 1 ' 

where A Q , A 2 are diagonal matrices. Let 


Then 


R o 0 


PVP 1 = 


Thus the column vectors of P form a set of N characteristic 
vectors of v. The row vectors of ere linear combinations 

of those of M 0 and similarly those of R M, . Hence R M 

Vl form respectively orthonormal bases of c (X) and its 
orthogonal complement. 

'• 2 ‘ 4 - —- St ^"-r- Estimator Sjfcject to Linear n„ tr1 
tions on Parameter^:. Let x hav ' -:-~ riC - 

distribution with mean « c^i 

£ ^ subject to consistent linear restrict ° * ^ 

with rank c = p-t. suppose that a general “t" C ~ = - (9iVen) 
ls £ + Bq, where B° is a 9 1 Solution of C'B = 6 

o/Li^rr r ution ° f u 

*" arbitrary Of dimension r 1 C ' B = °< •»« « 
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B (X) 


or 


= x£ 

= X(£° + Ba ) 


(3.32) 


£ B (X - X£°) = XBa . 


M 


Pi 


I 

# 


Since l = X - X 6 has an N-dimensional normal distribution with 
M an XBa an covariance matrix o 2 I, the problem of estimating 

A f ar is reduced to that of consiripH ^ 

- considering new variable Z and 

estimating new parameters a, cr . “ 

For the estimation of the linear parametric function L '6 

* £' (fi + B £> and ° need to know the rank of XB. Sinle" 

C'B - 0, the rank of the augmented matrix <x|c') satisfies 

rank (X |C ') = rank(XB) + rank (C) {3 33) 

that is, the rank of X B can be obtained by studying the matrices 
X»C without actually computing B and XB. 

An alternative approach which avoids such direct verification 

of ranks is as follows: Consider the problem of finding a linear 
function 

Xk b'X + d , 

? / 

such that 

Var (b ' X + d) = a 2 b ' b 

7 7 “ - (3.34) 

is minimum for all choices of b,d, subject to the condition that 

E(b'X + d) = b'X£ + d 

” " ' (3.35) 

and C -6 = 0 . 

Write d = M'6 . Noting that C'B = e , the problem is reduced to 
minimizing b'b subject to the restriction that r 

b'X + M'c* = L' , 




li 


(3.36) 


or 


(X- C) (|) = L 

M — 


Theorem 3.12. Let 




Ill ■' 


lr ! < 


4 


X - t " C ' = * he corre‘r° n * m _ eX ist column 
. xcib' *°* Ch ncl uae there 

where b w e con 

From (->• 

Pji2£~ m , y.at 

f such that 

vectors 

L „ (X'X)E + c '£ 

0 = CE - 

With 

b*= XCjL, w 2 


we get 
(X* |C) 


XC^ L 

C' L 
. 2 ~y 


(b*) 1 (b*) = L'C^'XC^L - L'C^ . 

Mote that if A' is e generalized inverse of A then A'AA = A 
Furthermore for any b satisfying (3.36) (b*) 'b - L'C^ 

Now by applying Cauchy-Schwartz inequality we obtain 


t (b*) 'b ] 2 < ((b*) -d*} (b >b) , 

nr 


(b 1 b) > 


r(b*) >oy 
(b*) 'b* 


- L'C^L . 


Ihus the minimum variance Unbiased estimator of L ' f. is ^ ^ 

(b*)-x + (m*) • e = L' (CjX'X + c 2 :-) 

which can also be written as l'& where t ■ . 

X'x^ + cX = x'x — ls ^ solution, of 


subject to the restriction C ' : s .. 

• 

Observe that (3 3 q, „ u 

S' «) sub Ject t0 ; ne bS 00ta ined o y flmriz ; 

™ itipu - b - sinc r— ci ^-. h Sine 2Jti. 













jjp^V Hi* 


■■ 



* 
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| - & ,c i x ’ x + ~' C 2—■ ' 

ffoni (3.37) 

V ar(Ii‘£) = a2 £ , C 1 X'XC 1 L 


= cr 2 L ' C x L . 


(3.40) 


Similarly for linear parametric functions , 

Cov(Lj£, ) = a 2 L^C 


- ° 2 ii2 c iii • 


(3.41] 


In the case where the matrix c i is not computed but it is known 


that 


LJ3 = I'X’X , L^e. = i'X'X , 
satisfying the condition that 
C'Y = C' 6_ = 0 , 


then 


Var (L}£) = a 2 L^/ Var (L'^B) = £ 2 L^6 

Cov(L^£,L^|) = cj 2 L^6 = a 2 L^x • 

If C'x = C'6 = 0 is not satisfied then 

i§'\ ■ * - - 

jf- Var (Lj£) = a 2 x'X'Xx , Var (L^B) = a 2 j 5 'X'X^ 
'' Covtt^B, L£B) =0 2 x'X'X6. . 


(3.42 


(3.43 


(3.44 


(3.4! 


3.2.5. Least Squares in Random Effect Models; Consider the 
linear model (3.9) where £ is a random vector with mean u and 
covariance matrix T. We shall be interested here in finding the 
best linear estimator (predictor) of the random variable L'B 
Since the conditional covariance of X given B. is 

Cov(x|3J = o 2 I , 

we get 

Cov(X) = E(Cov(x|£))+ Cov(E(x|S)) 


=, 0*1 + Cov(x|6) 
= 0 2 I + XTX 1 , 


Cov (X/L'B) = E (Cov(X,L' 3 jB.) ) + Cov(E (x|jS) ,L'B.) 


= 0 + XTL. 
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, r t ig up look for 9 linear 
To find the best linear estimator of L £ we ( 


function a + b'X satisfying 
E (a + b'X - L'£ ) = 0 , 


Var(a + b'X - L'8) is minimum 
for all choices of a,b . 


( 3 . 46 ] 

(3.47) 


Theorem 3.13 . If £ is unknown, the values b*, a* of b, a satisfyi n 
(3.46) and (3.4 7) are given by 

b* = X(X'X) _1 L , a* = 0 (3.48) 

provided there exists a b such that X'b = L. Also the 
prediction variance, that is, Var ((b*) 'X-L'j}) = L* (X'X) _1 L. 

Note. If (X'X)~* does not exist it is replaced by a 
generalized inverse (X'X)“ of X'X. 

Proof . Since 

X'b* = (x*x) (X'X) _1 L = L 
we get 

E((b*) 'X^L'8) = (b*) * Xy - L'y 
= L'£ - L'y 
= 0 . 

Since, for any other choices a,b satisfying (3.46) and the 
condition that X'b = L , 

Cov(L'£ - (b*) 'X, a + (b - b*) -x) 

= Cov (L'jB - (b*) 'X, (b - b*) • X ) 

= (b - b*) ' XTL - a 2 (b _ b*) 'b* 

■ ( -' X - ( t*> ' X > T k - ° 2 (b - b*) 'X(X'X)- 1 !, 

= 0-0 ~ 

= 0, 

we get 


Var (a + b* x - L' 3) 

= Var((b*).x- L. i + a + , 






- var«b*)' ? - L.e, + ^ Unc* 

Varllb*) *x - L'£) +Var ^ + <fe-b.,, s 
2 Var ((b*) ' X - l- 6) ^ ~ £*) 'jy 

/, o 2 (b*) ' (b*) 

= cj 2 L’ (X' X) ~ . 

It is interesting to r, a. 

y uo note th 

prediction variance are nat the best .... 0 

and its variance re s „ B „ 1 . 6 . Sa,e as the , the 




at *s 


t1Yp QTHes»s n 5 4 


QaE.D. 


and its variance resp**^ “» as the Ua^ a '* the 
parameter . Y> «he n b is tak qteres estimator 

In case u is know t0 ** 9 £ited 

ii is known the ot) ti„ 

| &* = (° 2 I + XTX'I'Vl c hoices of p ana a are 

a* = ji'L - _y_' X'b* 


and the prediction variance is U ' 491 

Via* + (b*> .x - L'BJ . Ij'TL - L'TX’b* 

= -■ (T ’ 1 + X '( d 2 r 1 x)- 1 L ( 3 . 50 ) 

M- 2 - 6 ; Llkellh °°d and Least 8^7 1 , 

hypotheses about linear parametric fmTT-' tes ting 

distributions of least souares <- ° 10tlS and flnding the 

j-easr squares estimators of diffor^*. 

we need now to specify the distributions of x v '“T"**" 

i shall assume throughout the remaining part of theater that 

| X 1'**" X N a re independently distributed normal variables with 

'? neans and variances given by ( 3 . 8 ) . W likelihood of the 

observations is then given by 

L( £' a2 ) = (2 tt) ~^ N (o 2 ) exp {- -^x(x-xB) 1 (x-X8) 1 . (3.5 

2a z ~ “ 

Taking logarithms we get 


(3.50) 


(3.51) 


Tog L($,a^) = _ loa(2fr)- ^logo^- -“^(x-XB) 1 (x-XB) . (3.52) 

• ’ 2a 

Differentiating (3.52) with respect to the component B^of B and 
• t ^e maximum likelihood estimates 3 of S and a of a 
hlyiven by 





m 




■Pb* » N * Lr5IS ,, „ 

a. ^* 53 ) 

<X’X)jL = X '* ' 

;2 „ i (x-x£) ■ 

1 *, (I -*<X’X)-X')x . under the assumption 0( 

• „ likelihood estimate ° _ quares estimate. < 

Hence the ®»” h same as its inverse (X'X)" 1 and 

normal distribution « «* has the unique m 

■ MX -as f»H «*' <X * 

,,„,v.- 1 X'X) = tX ,X >' X ' X£ = 1 ' - 

E (« = E((X X) „ nd E <£> ^ A' , 

,n the general case £ is -* u “ 9 ] 

and from lemma 3.8 

A 2j _ N-r 0 2 

1 , ", under the assumption of norsml distributions for 

jheorem j ^ likelihood estimator (also t e eas 

X 1.V . * tha estimable linear parametric 

squares estimator) _£ ±2 d L ,g has normal distribution 

function £•£ is independent Pf a an has the chisquare 

With nean L'£ and variance L' (X'X) L, no /a 
distribution with N-r degrees of freedom. 

Proof . Let A = I-X(X*X)”X,'- , B = L' (X'X)'X' . Then L'£=BX 

and Na 2 = (X - X£) • (I-X(X'X) ~X') (X-X&) . Using (3.19) we have 

BA = L'(X'X)“(X'-(X'X) (X'X)"X') = 0 . 

* 

Hence by an argument similar to that of Theorem 2.9 we conclude 

A A ^ 

that L'£, No are independent. As X has an N-variate normal 
distribution with mean X£ and covariance matrix a 2 I and L'£ is 
estimable/ using Lemma 3.9, we conclude that L'£ = BX i7 
normally distributed with mean L'jJ and variance L' (X~X)~L. 
mthe^re sisoe A is idempotent of rank N-r, Z, Theorem 2.7, 
s distributed as chisquare with N-r degrees of freedom. 

SSZSgZSU.. If X has full rank p, then B n" 2 Q * E ‘ D ‘ 

and! has the p-variate normal distribute 1 ' 316 independent 

covariance matrix o 2 (X'X)“ 1 m" 2 / 2 ° n Wlth 1116311 1 and 

chisquare with N-p degrees of freedom ^ diStributed as 


..... '^W 

proof .. Since x has f uU r A * Hyp °THESis i 15 

unique inverse (X'x)' 1 ar ,/ ank p * th en D * 

1 d p ^trix x'x k 

y'y x has 




1°* '* fc 


r 


(X'X)" 1 X'X . 


= (X , X) _1 (X'X) B 
= 0 , 


COV (3) = E (0 - S) (6_ - 6) ' 

= E((X-X) ^'X - 6) ((X'xl-^.x . », , 

= (X'X)" 1 X*0 2 X(X'X)“ 1 ~ £ 

= cr 2 (x* X) ~ 1 . 

Since (X'X) X X' is a p x N mtr±x Qf 

has the N-variate normal distribution vsi-k P (p < N) an * X 

covariance a I, it follows from Theorem 2.5^hat ^ T 
p-variate normal distribution with ^ 0 £ has a 

matrix o 2 (X'X) -1 . £ and covariance 

In this case 

*2 X - X0 x - yr 

N|_ =(^_ -) . (I-X(X'X) -1 (-- Z_)) 


I 0 

: Since 2 I - XIX'X)- 1 *' is an idempotent matrix o£ rank 

»-P, NO /o is distributed as a chisquare with N-p degrees o 
freedom. The independence of «_ 


«- P# ,o IS distributed as a chisquare with N-p degrees 
freedom. The independence of 0, No 2 follows from the fact 
X'(I - XU’XpV) = 0 . 


Q.E.D. 


Corollary 2. If x has full rank p , 

A 

a then 8^ is normally distributed with mean 6. and 

' ar( 8J = C. . o 2 , 

1 11 ' 

: °*(6.,a) = C n 2 

1 V C iJ ° ' 


'here 


(X'X)- 1 = C = < Cj J . 


(b) J 

Ik P ° r an ^ 1 * near parametric function L'JS* the maximum 

el ihood estimator L'B is normally distributed with mean L £ 

Varian ce o 2 l' (X'X)" T L. 




f ,,6 analysis of variance 

The proof is obvious. ^ indepe ndently distributee 

Example 3^ . ^t Y i'‘**' Y N 
normal random variables wit 


EfYJ = a+ 6x i ' 
2 

Var (V,) =. a • 


are known c 


constants (at least two 


i = 1 , where x 1/ ...^ x N 2 un Xnown parameters. 

a xr *.rk V aro distinct) and a, 3/ a 


of which are 


Here 


X = 


1 , X. 


1 . 


X'X = 




(X'X) 


I Yl ~ 1 - 


N— 1 

N J 1 (x.-5) 2 - 

\-Nx , N/ 


- 1 

where x- 'n ^ * The maximum likelihood estimates of a, (3, 

o i = l , 


2 

are 


S ( x -5)(y -£) I (x . 5)y. 

= _i = l 1 1 


2 (x - x) 

i=l 1 


E < x , - x) 


= y - 3x , 


?= N * *i 

1=1 1 


1 - 1 r v 7 { - Z (x i “ x ) y ) 2 

n i (y^y) 2 — 

2 (_ x) 2 

' ^ corres P°ndlng esuLtL 
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inear hypothesis \ 17 


E(0) = 


l (X. - x) E (Y. - ?) 
i=l i i ' 


Z(x ± - x) 2 


- 8 , 


(a) = a+ 8x - 8x = a. 


Var (3) - N 


O * _2 2—9 

N : ' Var (a) = £- + 2 _x_ 

v / “\ 2 N m 

Z < X t - X) • . r,2 


I (x.-x) 
i= 1 1 


Cov(a, 8) - - 


a 2 x 


I (x.-x) 2 
i = l 1 


a E (c ) = 


2 ) _ (N-2) a 4 
N 


Example 3.4 . Consider Example 3.3 with 
E(Y^) = ot+6(x i - x) 

Var(Y i ) = ° 2 , i = 1.N . 

It is easy to check that the maximum likelihood estimates 
(which are the same as the least squares estimates) of a, B are 


a = y # 8 = 


£ (x.-x) (y.-y) 
i = l 1 1 

N - 2 
l (x. — x) 2 

i= 1 1 


\ i=i 1 ' 

A A A 2 

The corresponding estimators a, B, o satisfy 


E(a) = a, E (0) = 3, 

E( ; 2 , = ^ a 2 , Var(a) = £ . 


^r(B) = 


“ _ 2 

l (Xj — x) ^ 


, Cov(a, B) = 0 . 


Lon of a random 


T „A«LYS. 5 OF « R '«“’ ^ 

3.3. PERCENT POINTS AND tribub ion of a random 

..percept point. ° £ £ ra „dor» variable svch 

“* “T x is the valve *!_„ ^ tW t point of the 

variable X The lower «-P (J < x ) = o. tV. 

*■« Tt on ^“Ihe value * a •** abo ut the origin ^ \ 

ZZl» if «- distribution is sy ^ ^e the upper arus K 

x . The values X 1-0#n ' " e distribution with n 

XCl V^rcent points of the chisq we will use the | 

lower a-percent p 1v Similarly I 

degrees of freedom respectrvely. , to 3en ° te «i. I 

a. t an ° ' 1 A . , [ 

notations ^l—oi,n a,n the t—distribution (central) 

upper and lower a-percent pC ^ nt ® ^distribution with n r n j 

with n degrees of freedom an diff erent a these Val tes 1 

degrees of freedom respectively. j 

can be found from statistical tables. 

UC&1A _me A \ -.4- T CT O QV 4- ^ 



>e rOUUU - . L 

. • iq 74 ) it is easy to 

From the F-distribution (see Gin, 


verify that ■ 

. (F l' 1 (3 - 5 6)l 

T l-a,n 1 ,n 2 a ' n 2 ' n l 

, • =v,d the t-distribution can 

The percent points of the chisquare and the 

also be found from the F-distribution by using the relations 


(n 


= n F 


a,n,‘ 


U.57^/ 

(3.58)1 


= (F )*> Vi.b8)l 

W2,n 1 a, l # n ; 

jjQ-fc x be distributed as a normal N—vector with mean and 
covariance as given in (3.9) . To find the confidence interval 
of 8, , assuming that it is estimable we use the fact (3.54) 

^ A 2 

that 8 k is normally distributed with mean and variance c ^° * 

and 

j *2 (X - x£) (X - XB)‘ 


A 

iere 3 satisfies (3.53), is distributed independently of B-> 

1m m m __ * - _ ! a |_ . 


• -- — —wv.ij.vui.cu iiiucpenugntiy ui 

chisquare with N-p degrees of freedom. Note that the assumption 

estimability of each component of £ implies, that X is of 
^ p. Therefore 
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,-p)* <vv 


r f has the Student t distribution with N-p degrees of freedom. Since 
the Student t-distribution is symmetric about the origin we get 

In n \ 


(N-p) ^ (B -ft ) 

p{ ^a/2, N-p * - T— * t 

> Sk 0 


a/2, N-p 


= p{B k _t a/2, N-p 


= I-a. 


" 5 * “ 2 V 


zrr e * 5 ** + t «/2.N- 1 




(3.59) 


Thus the (1-a)* confidence interval of ^ is given by 


B Jc _t a /2,N-pl 


NC kk a 


2 \H 


" /nc S 2 \^ 

' + v 2 ,n- p (— h 


(3.60) 


V >. For an y estimable linear parametric function L'£, by Theorem 
% 3.12,' 

(N-r)* 5 (L' 3. - L ^3.) 


nr. 


N/Ncr L(X'X) L 

has the Student t-distribution with N-r degrees of freedom. We 
are assuming tacitly that the rank of X is r. Proceeding in the 
same way as in (3.59) and (3.60), the (1-a)-percent confidence 
interval of L'3. is given by 


(t'B-t ( nS 2 L'(X'X)-L h 

*• — ft / T M _ \ J 


"a/2, N-r 


>'• ■ i-i * Vi'”- 


in the 


case where the rank 'of X is p, N-r should be replaced by 


N "P and (X,X)’ by (X'X) -1 . 

Let us now consider k linearly independent estimable linear 
^metric functions 

01 s -i£.<f> v = l* £ 
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which in matrix notation may be written as 

± = L'0 , (3 * ( 

vhere ± = «fc> 1 » L' is a k x P matrix^with rows 

L^, i = 1, .../k. The least square estimator ± of 4> is given 


i = (J r ...♦V ' = L£ 

where £ is given in (3.18) and <p ^ Obviously 

E(^) = £ , 

and from Lemma 3.9 

Cov(^) = a 2 (L' i (X , X)"Lj) = o 2 V (say). 

From Theorem 3.12. corollary 1. 

* 

A 4 A 

(1 - £) ' V~ x (± - ±) 

~ 2 2 '"‘2 2 

xs distributed independently of Ncr as x> and No /a is 

2 K 
distributed as X N r * Thus 

(N-r) (_£ - _£) 1 V -1 (4 - 4 ) 

' ”2 ' (3 

k N 0 ^ 

is distributed as a central F with (k,N-r) degrees of freedom. 
Hence 

(N-r) (£-£) 'V -1 (£-£) 

p { - 7r T~ - < F. , } = 1 n 

k N a 2 ■ ~ l-a,k,N-r J 1 “ a ’ 


(3.6 


l-a,k,N-r 


} = 1 -a. 


The set of all £ satisfying 


(N-r) (£-£) ' v _1 (£-£) 


k N o' 


- F l-a,k,N-r 


for given 4 , o 2 is an ellipsoid with centre 2 e ■ 

U-a)-percent confidence set of 4 . Note th “ 13 Called th< 

that this random ellipsoid covers -the true friability 

1 -a, no matter what the values of ,,v Parameter point 4 ii 

3 ° f Unknown Parameters B,o 2 are- 
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UN I VAR I ATE 


3 . 4 . TESTS OF HYPOTHESES In THE REGRESSION 


MODEL 


General linear hypotheses are hypothec: 

regression model (3.9) „ ith the . conce ming parameters £ of 

, are independently normally distributed^ith 1 ^ 10 " ^ Xl . Xs 

• as given in (3.9). The technique wh • „ meanS *** Variances 

these hypotheses is called the anal 1C ^ lr '° Stly USed in testin « 

is sometimes abbreviated as ANOVA V “ ianCe techni< 3 ue and 

down the total variation into ^hj“ 1I,VOlVeS ^ 

• r\ , °gonal components representing 

variations due to different cauco 

c puse factors. Let x = (X X. ) • 

be normally distributed with f/y* “ ,1 '***' n 

^eawithE(x) = XB and«Cov(X) =a 2 I. We 

shall consider two different r a f onn .. , " 

rent categories of testing problems. 

Under the first category we wi ii j _ 

y y we win consider the following two: 

(a) To test the null hypothesis H -s- ft = - (o.° a°\ , 

where £ is specified and a< is unknown. P 

(b) To test the null hypothesis H : ft = 6 ° v - i 

o o 9 ^ — ^ • • • • $ Q < p 

when •..« and a are unknown. 

For these two problems we shall assume that all ^ are estimable. 
In other words X is of full rank p. 

Under the second category we shall consider the following 
two testing problems: 

(c) To test the null hypothesis = 6 where 6 is a 

specified constant and L'jS. is estimable. 

(d) To test the null hypothesis H q that k estimable linear 
parametric functions (Jr have preassigned specific values 6 ,. 

In th e notations of (3.60) and (3.61) 

V i = i 

where 6 = (6 1 .6^) • . 

Likelihood Ratio Tests : We shall use the likelihood test 

Principle to derive the test statistics in the first two cases. 

In the other two cases we shall give the commonly used optimum 
tea*. 

^ procedures. 
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Case (a) . The likelihood of the observation vector x 
= (x r ...,x N ) • is given by 

L(xj3, o 2 ) = (2 t ro 2 ) _N//2 exp {- ——j (x-X3) ' (x-X_§) }. (3.65) 


20 


Given x,L is a function L(_3,o ) of J5 and o only. The parametric 
space ft is the p + 1 dimensional space 

ft = {($, o z ) : - oo < 3 < oo , <j > 0} . Since L (J3, a ) is maximum 

when 


- 1 . 


3=6= (X'X) X'x 


^ AQ 1 A A 

0=0 = i(x - X_B) • (x - Xji) , 


we get 


N 


max L (_B, a ) = i 

ft V 2tt (x-X 3 ) ' (x-x3) 




exp {- %N} . 


(3.66) 


Under the null hypothesis H q , r reduces to the one dimensional 


space 


w = { (3°, o 2 ) : a 2 > 0} 
and 


max L (3,o^) = 
u> — 


N 


hN 


2tt (x-X3°) ' (x-X3°) y 


exp {-i^N} . 


(3.67) 


Thus the likelihood ratio X is given by 

max „ ,. 2 . . * * 

, a) MB,o ) f (x-XB) • (x-XB.) \ HN 


UNIVARIATE GENERAL LINEAR HYPOTHESIS 123 


BSPS? - 

jpife 

, (B - 6°) 1 X'X(B - 8°) 

. (X - XB°) • X(X'X) -1 X' (x - XB°) , 

A A 

= (X - XI) ' (X - XB) = x' (I - X(X'X)’ 1 X»)x 
q 2 - - 

= (X - Xj3°) • (I - X(X'X) _1 X') (X - X£°) . 

Since X is a monotonically decreasing function of q^/q 2 • ^he^ 
likelihood ratio test rejects H q whenever 

S s C (3.69) 

q 2 

where the constant C depends on the size a of the test. 

The test statistic q../q can also be obtained by the method 

J- ^ A A o 

Of least squares. Let us first observe that q x = (3 - £ ) 'X'X(£-£ ) 

= min (x - XB) ' (x - X3) - min (x - X3) ' (x - x$) (3.70) 

id ” n — 

In the terminology of least squares 

min (x - XJ) ' (x - x£) and min (x - XB) ' (x - XB) 
w fl “ ~ “ 

are called the sum of squares due to the null hypothesis (or the 

total sum of squares) and the error sum of squares, respectively 

and q^ is called the sum of squares due to the deviation from the 

null hypothesis or the sum of squares due to £. For testing H q 

we take the ratio of the sura of squares due to deviation from H 

o 

to the error sum of squares as the appropriate test statistic. 

Theorem 3.15 . Let Z = X - x£°, Q ± = Z ' X (X ' X) " 1 X• Z and Q 2 = 

— (T-X(X'X) X')£. Then Q^/o 2 and Q^/o 2 are independently 

distributed as chisquares with p and N-p degrees of freedom, 
respectively, when H q is true. 

Proof. Since under H q , Z has an N-variate normal distribu¬ 
tion with mean £ and covariance matrix a 2 I, and X(X'X) - ^X', 

1 ~ X(X'X)” ^X 1 are idempotent matrices of rank p and N-p 
respectively, by Theorem 2.7, Q^o 2 and Q 2 /a 2 are distributed 
(under H q ) as chisquares with p and (N-p) degrees of freedom, 
respectively. Furthermore, since 

(I ~X(X'x)' 1 x«) X(X , X)~ 1 X‘ = 0 , 
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we conclude from Theorem 2 . 10 that under H,. QyQ 2 are 
independent. Q.E. d . 

H N=£ has the i-’-distribution with p and 

Thus,, under H„. -f Q is gl 

H .p degrees of freedom. Henge the c ±s usua J 

r _ F . The computation of the ues> y 

1-a, p,N-p 7 0 

carried out in the form given in TaOie 

Table 3.2. ANOVA for testing H q : 6 . = 6 . __ 

Sources of variation Degrees of freedom Sum of squares F-ualue 


Due to 6 


N-p 

q q-, 


Error 


Total 


(x-X6°) • (x-X3°) 


Case (b) . Let X be partitioned as X — (X^#X2) where X^ is 
the N x q submatrix of X containing the first q columns of X and 
X2 contains the remaining p-q columns of X and let 

-(1) = (8].* *' £( 2 ) = (3 q +l' * * *' ’ ‘ 

Then 

X£ = X l£(l). + ^2—( 2 ) • 

We are now interested in testing the null hypothesis 
H o : -( 1 ) = -( 1 ) = , 

where £ (2) , o 2 are unknown. The parametric space n is the same 
as that of case (a) . Under H q , it reduces to 

“ = t(6 °l)' S ( 2 )'° 2): - " < i - q+l,...,p, a 2 > 0}. 

The maximum of the likelihood function Ml.a 2 ) of ( 3 . 65) under „ 

is given in (3.6#) . To find the maximum of L( 6 .a 2 ) unde r „ we 
proceed as follows: 

. - 1) 1 (x - XB) 















:~/g 


' 2o 2 = 

o 2 

where u = x - X ]£.(i) and J3^ 2 -)'° a , re unknown parameters 

Comparing (3.71) with (3.65), it is easy to conclude that 

naximum likelihood estimates of 0 and a 2 under H ar 

o 

1 ( 2 ) = (X 2 X 2 > x 2 — 

“ 2 * w ^ ' <2 - x 2 i (2) ) • 

It may be noted that X 2 X 2 ^eing a principal minor of 
positive definite matrix X'X, is positive definite and he 
a unique inverse (X'X )" 1 . Thus from (3.71) and (3.72) 


max L(6,a ) = -*-—-V {-kN} 

V"‘H-x 2 B (2) ). (u-x 2 . 6(2) J eXp{ - N> 


( - " X 2 -( 2 ) } ' ( H- X 2§(2) )I = JJ' & ~ x 2 ( X 2 X 2 } lx y u 

= fx_y ft° ) I T_ Y fY> Y t ir V C ° \ 


“ L( £' a ) \(u- X 2 £( 2 ) ), (u- X 2 £ (2) y 

(1 + 


= (X - X0) • (X - xh 

= *' (I-XtX'X)"^')* 

“ { *“ X 1^1)- X 2^(2) ) ' U-XU'X)"^') (X-X.0 
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qi = (u-x 2 B (2) )’ (u-X 2 6 (2) ) - (x-X£) • (x - XI) 

- 1 , 


J^XJu - x' (I-X(X'X)" X')x 


= U' (I-X 2 (X^X 2 / - 2 

= (S-X 1 B° 1 ) -M(«> ,IX(X 'X>" 1 *' 

” X 1- < (1) ” X 2- (2) ^ * 


- x 2 (X 2 X 2 ) " lx 2 ) 


The computation in (3.75) follows from the fact that 


X* (I-X(X'X) 
implies 

X| (I-X(X'X) 


-1 


X') = 0 


-1 


X') = T> # 


X^d-X(X'X) 


-1 


X') = o 


(3.75) 


(3.76) 


Since X is a monotonically decreasing function of t ^ ie 

likelihood ratio test rejects H q whenever 

> C , (3.77) 


where the constant C depends on the size a of the test. 

As in case (a) the likelihood ratio test statistic q^/q^ 
in (3.77) can also be obtained by the method of least squares. 
Obviously q 1 + q„ is the minimum value of (x-XJ 3) ' (x-X_3) when 
the null hypothesis H q : = JL(d holds good and hence it 

represents the sum of squares due to H q . Since q 2 represents 
the error sum of squares, in our present notation q^ will 
represent the sum of squares due to the deviation from the null 
hypothesis H q or the sum of squares due to adjusted for 


-(2) # 


If 6 


-U) " - 


= 0 then 


*1 =2'* * 1(2, X-JjX- X’X + 1-X'x 


= B'X'X - £ ( ' 2 ) x^x . 


(3.78) 


The quantity 3_'X'x will be called the reduction sum of 
squares duetto £ /or simply the sum of squares due to £ . 
Similarly H( 2 ) X 2— be called the sum of squares due 

t0 -(2) ignoring . 
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S!SS re S L^M- Let Z = X - , 

a 2 = £' A 2. and Q x = Z'BZ where A = i_ x(x . x) -l x , 

/Ul v\ ” ^Vl V /v>u X — 1 


, B - X(X-X)- x- - X 2 (X i X 2 )- 1 X . . 


Then Q±/ a 


2 re independently distributed as ctiisquares 


with q and N-p degrees of freedom, respectively. 

Proof.. Since under H q ,z_ has an N-variate normal distribution 
with mean 0 and covariance matrix c 2 i, and B,A are idempotent 
matrices^of rank q, N-p respectively, by Theorem 2.7 Q^o 2 
and Q 2 /a are distributed under H q as chisquares with q and N-p 
degrees of freedom respectively. Furthermore, since BA=0, we 
conclude from Theorem 2.10 that under H q , Q^,Q 2 are independent. 

Thus, under H q , -=£. has an F-distribution with q and N-p 

degrees of freedom. Hence the constant C in (3.77) is given by 

c = F l-a,q,N-p* For P ract ical applications = 0 is quite 

useful. We shall give below the ANOVA table for H q : $^^ =0. 

Table 3.3. ANOVA for testing H ; $..,* = 0 

__ o — ( 1) — 

Sources of variation Degrees of freedom Sum of squares F-values 


Due to £ 

Due to B (2) 
(unadj us ted) 

Due to 
(adjusted) 


£'X'x 
1(2) X^£ 


N-P q x 

q q7 


For any estimable linear parametric function L'£, by 
^eorem (3.14), under H :L ! (3 =6, 
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« the Student against alternatives 

Hence an appropriate tes 

L'0 *6 rejects H Q whenever (3.80) 

:| 2 ^.--P ‘ appropriate test procedure is 

case (d) . For testing * Q .± “ 

to reject H q whenever (see 

(N-r) (1-6) (3 * 8l) 

—_— > C . _ i. K 7 _r“ 


- *2 

k K 


l-a,k, N-r 


„5. ONE-WAY CLASSIFICATIONS WITH LINEAR MODELS OF FULL RANK 

„ * several univariate normal 

It refers to the comparison of mean 2 T t X. 

mnulations with the same unknown variance a • ij' 

populations witn rne from the ith normal 

1 = 1 ...N, be a Random sample of size ^ 

j - x, . ../"j c 2 . = 1 .2..../P# and let 

population with mean 8 i and variance -# 

the random samples from different populations be independent of 
each other. Write 
P 

N = £ N. , 

i=l 1 

X = (X 11# ...,X 1N ^ # ...» x p i'*** /X pN ^ ’ ' 
x = (x u , ...,x pl , ...,x pN ) • , 

3 = (3,, ...,3j • . 

— .1 p 

The N-vector X has a multivariate normal distribution with 

E(x) = x3 , Cov(X) = o 2 I, (3.82 

where the N x p matrix X has the following form: the first row 

are all = (1,0,...,0), the next rows are all e 1 = (0,1,0, 
e..,0),..., and the last N rows are all e 1 = (o 0 1) 

Obviously X has rank p andhence (3.82) represents a fixed effect 

linear model of full rank. 

We are interested here in testing the null hypothesis 
v 6! = ... = B p = B(say), where B, o 2 are unknown> 







<3. 


mfmm 


8 0 ) 


m 


(3 D 

, 8 l) 

^NK 

n ° r ^l 


The parametric 

„ r O S P®C© o . - «tN| 

= {(6,0 2 ). 2 is t v ■'° 1 ' Liim E . d ■««,«, 

~ '• —<(l< e <P+1) Hvp 0TH€ Sls ill 

Under H q lt r • i « 1( W 

» = <(e,o 2 ). S t0 the 2-din."’ 1 ” 02 " 0 } Mce9ive »b, 

S < «o rr2 l0 »lal 

Let E(x. ) '° > 0 } P9ce 

hypothesis H^i«. . U *j * w ith t h 

o ls lin^.. , ttle 



gi ven by y 
,4 _ . iJ 


ij u, .. 

H o u li^r h Wlth the nota 
, • ij = B j . hy P°thes is J ; 09 « tx*^. 

line on „ hich J ' l.„ l «> s-r , x 3 - 2 the 

the N coor,. 1 1. ... r> p) ' with t, 

,. dlB * t « o £ „ , p ;, and “ith , u the 9 


M pl # » U ) I „ 

PN P are equal. 


a = ( W 

M ir •••* 


mal 
let 
t of 


N. 


. D N - 

Let x = 1 £ i 

N ■ 1 * V 1 1 

1=1 j=i o' i. = r i x 

n N. . i j = i iJ * Slnce 


I I (x o \ 2 P i 
1=1 J = 1 lX ij- B i ) - Z E 


E (X.,- x )2 

1 = 1 j = i ij x iJ + 



iM 8 l (X i.-.» 2 . 

ij = X i. of V when ir fi holds. 


_ N. 
P l 


IJ 


. Z , (X il" X )2 + N(x -B ) 2 . 
1 = 1 j = 1 ■ L J 


i. . under tt is given by u. .=x 
ij <d ’ 1 


(3.821 

rows 

1 

0 , 

feet 


Thus from (3.7), the test statistic w is given by 


P i 2 

I N.(x. -x ) 

N . .1 x • • • 

=£. iZk 


P_i F ? i ( x -x ) 2 
i=l J-l 1J 1- 


(3.83) 


and the distribution of « depends only on the parenete 


r 2 _ 

r f' - 


£ a. 1 

i=l 1 


-e ) 2 . . ? ■.(./«. «"* the unlforrty 

* -- , where 3 . , x 1 

2 * 1 = 1 
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_,^, 4 . the alternatives 

most powerful invariant test of H Q , a 9 

r . _T_^ 4 - Vi q constant c 


0 , rejects 


l MV- www -- Q ’ 

> whsr6 tli© constant c 
whenever w * c, wnere 


grrs — 

• ° * tests we Shall show later that 

depends on the size a of the tests. 

c = F 


l-a, p _i,N-p* 

•we now treat the problem as the regression model and find 
the likelihood ratio rest statistic for this probl 


The likelihood function is given by 



N. 


L(B,a 2 ) = ( 2 ff) ”^ N (cj 2 ) ”^ N exp {- F ‘ E (x ij _3 i 

” 2o i = l 1 = 1 


) 2 ) 


It cari be easily verified that 


max L (£, a ) =| 

03 


N 


J^N 


P 

k 2ir ( Z 


N, 2 

I 1 (x -x ) Z 

i=l J = 1 J 


exp {-^N} # 


• • 




max L(8,u ) 

/s — 

P 


N 


P N. 

2 ir ( Z Z 1 (x .-x. ) 

i=l j = l 1J ‘ 


exp {-^N} 


Since 


N. 

P i 


N. 

P 1 


Z Z ix. ,-x ) = Z Z (x. .-x. ) + Z N (x -x ) A , 


l l v x. ,-x . 

1=1 j = i ** i-1 j=l 1J 1 * i=l ii* 


the likelihodd ratio A for testing H can be written as 

o 


A = 


max L(j5,0 ; 

0 ) 


max L(8, a ) 

n 


(i + i r^ N 

q 2 


(3.84) 


where 


P 


q, = I N.(x. -x ) 

i=l 1 1 ■ 




~ N- 

P i 

Z E (X ii‘ X i ) 
i=l j = l lj x * 


9 
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, the terminology of least squares 

uares due to the deviation from the ll S “" ° £ 

V-ne null hwnntv,__ 


^.. s .;:r,r\r — 

q l + q 2 represents the sum n-f 
fhpsis H or the 4 - um of squares due ,to the null 

tj hypothesis H o or the total sum of squares. From (3 84) the 
f likelihood ratio test rejects H whenever 


N-P Qi 
q 2 


(3.85) 


( where, the constant c depends on the size a of the test. This test 
has also been obtained in (3.83) by using invariance consideration, 
this test can also be derived from the least squares theory by 

taking the ratio of [min(x-XJ) • (x-xj) -min (x-XB) • (x-XB)l to 

W Q — ~ — — 

: min (x-XB) 1 (x-X_6) . 

i p p 

I Theorem 3. 17 . Let Q 1 = £ N. (X. 1 -X ) 2 , Q 0 = E S 2 


N. 

ft X 1 


(X..-X. ) 
1J 1. 


= E Ex. 

j=l XJ x * •* N i=l j=l iJ “i- " N“i ^ A ij. 

2 2 

' Q 2 / '° are ind eP en dently distributed as chisquare 

§ with P-1/ N-p degrees of freedom, respectively, when i is true 

° 

I Proof. Obviously for each i, X. , S . 2 are independent, 

H n 2 / 2 1 • 1 

in /o is distributed as chisquare with 1SL-1 degrees of freedom 

land v^rx . is normally distributed with mean v^T. 6 and variance 
jl 2 i 

when H is true. Since random samples from different 

I o 

■populations are independent of each other we conclude that Q 2 /a 2 
ls distributed as chisquare with N-p degrees of freedom and is 
^independent of . Furthermore under H q it is easy to see that 
I ®l/ a is distributed as chisquare with p -1 degrees of freedom. 

q Q.E.D. 

Thus, under H , —- has the F-distribution with p-1,N-p 

\ ° p- 1 q 2 

° f free dom. Hence the constant c in (3.83) and (3.85) is 
Jpnsi F l-a,p-l,N-l * 


p 

Q ? = E 
•i=l 


L_ £ 

J. . . 
i J=1 


I 
i 


• i 


-,., .*w ! - ■ ' ■'***' ■ *■'■ ■ 7 '■' r '■■■-• ■ • 
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Table 3.4. ANOVA for One-way classification 


Sources of Degrees of Sum of squares Expected mean 
variation freedom surr. of squares 


Between 

populations 


£ (3.-B) 
P- 1 i= i 1 


Within 

populations 


Total 


N. 

Pi 2 

Z S (x. .-x ) 


If the F-value is greater than F i_ a , o-l,N-p' we say t “ at it 
is significant at level a and consequently we reject the null 


hypothesis =...=3 p . ^aturally such rejection leads to 

further investigation to decide which means are different. Since 


X. - X. 

1 . JC . 

/' k * k ’ ’ 

where S 2 = Q 2 /(N-p), has the Student t-distribution with N-p 
degrees of freedom, the null hypothesis that 3^ = 3^. will be 
rejected at level a if the observed difference x.^ - x^ is 

greater than the critical difference 


/ 2 (- i- + JLv 

t l-a/2,JH-pJ S N k ' 


(3.86 


where s 2 = q 2 /(N-p)• Thus we may compute all possible critical 
differences arising out of all possible pairs i / k = 

1,...,p, and compare them with their corresponding observed 
differences. On the basis of such comparisons it will be 
possible to divide the p populations into different groups such 
that the populations in the same group have the same mean and 
those in different groups have different means. The computation 
is simplified if all n are equal to n (say). In subh a case we 
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f coopute the common critical di££ erence 

E 1 ,- 

I . 

^ t^ccd- t i-a/2,N-tV n (3.37) 

2 l Pand compare it with the observed differences x. -x , for all i * k 

2 if any difference exceeds the ccd 4 -*,^ a.*. i. k. ■ 

11 \ ^ 6 CCd ' then corresponding means 

^lare said to come from populations with different means. 

When comparing a nunber of population means one would thus 

typically wish to decide not only whether the means ate equal, but 

in case the hypothesis of equality of means is rejected, would like 

to rank or divide them into subgroups such that the populations 

. with the same means belong to the same subgroup and with different 

at it . *e ans to different subgroups. This we have achieved by the 

! I combined use of F and t-tests. Other types of procedures for this 

° % haVe been P r °P°sed by, among others, Student-Newman-Keuls, the 

Since range test", Duncan (19 55) "the multiple range and F test"; Tukey 

(1951, 1953), "a test based on allowances" and Scheffe (1953), 

"S and T methods for judging all contrasts in the analysis of 

variance". These procedures are commonly called multiple 
\ coY-fcx^^on 


procedures. We shall discuss only St udent-Newman-Keuls 
3nd Duncan ' s multiple range test. For a detailed discussion 
'•m/* multiple comparison procedures and their relative merits and 

demerits the reader is referred to Federer (19 55) . Though we will 
discuss these two procedures for comparison of means in the one-way 

I classification only, it can be applied to similar other problems 
without any difficulty. Let the observed means x ,...,x be 

w ra 1 ^ • p • 

ranked from the highest to the smallest as x ,...,x , i.e., 
i» .. P r 


i C3 1 


X 1 = the smallest of x 


P- 

i.e.. 


Ui. /v. < f • • < /v /••<■/ ■» 

1 . p. 

= the largest of x.^ , 


;...; x. = the ith largest of 


quantity x - x, is called the range 
p i 


q (x d " x l } ^ 

u cfr P* N-p = — 2 --- 

s 


(3.88) 


d r Wh ere 


N = .. = N = n (by assumption) is called the Studentized 

1 * P 


e 
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. X 

rangG ° f ^^erclnt point o£ the S 
the upper a P are given 

__ a.. _ v 


x .. " Grange' » ere v = N -S) 

observed — 2i Stude'nti^d »n. 


denote 


K? 


The value* of a o,P« v 


( igra-t-^- 1 " 30 — (19 ^ 


„anJ<£uls_T£2Jb^^ critical 

- (1212Uj we compute the of P «**r given a , v) 

SSHiS-ii^ 22 -^ given by ( aS 

Studerstized range p , U. 89 ) 

2 



W p = q a*PA v n x with w . — 

NOW we compare the observed range p i and we P c onclude that the 

than W p . the process of »«*•£■“ exceed s W , we divide the 

p -“ *? * ranke d mSans each, x p to x 


P ft are equal 

means $w»**'P P n f 

1 4„4. 0 two subgroups or p 

ranked means in e the ranges 

andx .tox.. Then we compare . _ 


If it is less 


-e, 


X ,-X. 
p^l 1 


. .0 x.. Then we comp— t ne Jans in the 

with W p J. If either range is less than p-1^ ^ ee<js w _ ,, t] 


With Vr “ either ra "^r Tf either SSg. exceeds W p _ r then 
corresponding ^oup^ar^eg are again divided into two 

the P-1 -“J * each and the ranges are compared with W p _ r 

groups of p-2 means is found whose range 

This is continued until a group of 1 means MgWKS* 

does not exceed . In other words, by this method the 

between any two means is significant provided the range of the^ 

observed means of each and every subgroup containing the given 

means is significant according to Studentized critical rang*? 


10 
1 i 
14 


3,5,2. Duncan's Multiple Range Test ; This test procedure is 
essentially the same as that of Student-Newman-Keuls except that 
here the observed ranges are compared with Duncan's a-percent 


critical range 


D P 


‘a. 


P/v 



(3.90) 


Where q “,p, V denotes the upper a-percent point of the Studentized 

range based on Duncan's range in Table 3.5, instead of w . ?S!gSi 

feels chat this test is better thari => P 

for comparing d,« * 7 prev:Lousl y proposed test 

r comparing differences between = «,. a. 

between any two ranked 


means 
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Table 3.5 
* 


Values of q 


a ,p,v 


a = .05 


10 

3.15 

3.29 

3.37 

3.43 

3.46 

3.47 

3.47 

3.47 

3.48 

3.48 

3.48 

12 

3.08 

3.23 

3.33 

3.36 

3.44 

3.46 

3.46 

3.46 

3.48 

3.48 

3.48 

14 

3.03 

3.18 

3.27 

3.33 

3.37 

3.41 

3.44 

3.36 

3.47 

3.47 

3.47 

16 

3.00 

3.15 

3.23 

3.30 

,3.34 

3.39 

3.43 

3.45 

3.47 

3.47 

3.47 

18 

2.97 

3.12 

3.21 

3.27 

3.32 

3.37 

3.41 

3.45 

3.47 

3.47 

3.47 

20 

2.95 

3.10 

3.18 

3.25 

3.30 

3.36 

3.40 

3.44 

3.47 

'3.47 

3.47 

24 

2.92 

3.07 

3.15 

3.22 

3.28 

3.34 

3.38 

3.44 

3.47 

3.47 

3.47 

30 

2.85 

3.04 

3.12 

3.20 

3.25 

3.32 

3.37 

3.43 

3.47 

3.47 

3.47 

60 

2.83 

2.98 

3.08 

3.14 

3.20 

3.28 

3.33 

3.40 

3.47 

3.48 

3.48 

100 

2.80 

2.95 

3.05 

3.12 

3.18 

3.26 

3.32 

3.40 

3.47 

3.53 

3.53 

ro 

2.77 

2.9? 

3.02 

3.09 

3.15 

3.23 

3.29 

3.38 

3.47 

3.61 

3.67 




a = . 

01 


a = .0 






10 

4.48 

4.73 

4.88 

4.96 

5.06 

5.20 

5.28 

5.42 

5.55 

5.55 

5.55 

12 

4.32 

4.55 

4.68 

4.76 

4.84 

4.96 

5.07 

5.17 

5.26 

5.26 

5.26 

14 

4.21 

4.42 

4.55 

4.63 

4.70 

4.83 

4.91 

5.00 

5.07 

5.07 

5.07 

16 

4.13 

4.34 

4.45 

4.54 

4.60 

4.72 

4.79 

4.88 

4.94 

4.94 

4.94 

18 

4.07 

4.27 

4.38 

4.46 

4.53 

4.64 

4.71 

4.79 

4.85 

4.85 

4.85 

20 

4.02 

4.22 

4.33 

4.40 

4.47 

4.58 

4.65 

4.73 

4.79 

4.79 

4.79 

24 

3.96 

4.14 

4.24 

4.33 

4.39 

4.49 

4.57 

4.64 

4.72 

4.74 

4.74 

30 

3.89 

4.06 

4.16 

4.25 

4.32 

4.41 

4.48 

4.58 

4.65 

4.72 

4.72 

60 

3.76 

3.92 

4.03 

4.12 

4.17 

4.27 

4.34 

4.44 

4.53 

4.66 

4.66 

100 

3.71 

3.86 

3.98 

4.06 

4.11 

4.21 

4.29 

4.38 

4.48 

4.64 

4.65 


3.64 

3.80 

3.90 

3.98 

4.04 

4.14 

4.20 

4.31 

4.41 

4.60 

4.68 


Reproduced with the permission of the editor of Biometrics 
fr ° m the paper by D.B. Duncan, Biometrics 11:1-42, 1955. 
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Example 3.5 . Consider the data in Table 3 1. The anal Ysis 
of variance table for this is given in Tab 


Table 3.7. AN OVA for the data in Table 3.1 
Sources of variation Degrees of freedom Sum of squares 


Between Varieties 
Within varieties 

Total 

gince 


18.6757 
298.784 3 

317 .4600 


1.08 34 


' 4.22 , 

if 

a = 

0.01 


3.40 , 

if 

a. = 

.025 


K> 

• 

00 

o 

% 

if 

a = 

.015 


2.205 , 

if 

a = 

.10 


■1.415 , 

if 

a = 

.25 



1-a, 3, 5, 2 


we accept the hypothesis of equality of means of different 
varieties at size a < .25. 

3.5.3. Tests of Equality of Variances ; Let ^, j = 1,...,N^, 

i = 1, ...,p be a random sample size N. from the ith population 

2 1 

with mean 8^ and variance and let 

H ; a, 2 = ... = ° 2 (3.9 

o l p 


H ; a = ... = a (3.91) 

J. 

One can obtain the likelihood ratio test or the Bartlett modifica¬ 


tion thereof for this problem. We shall treat here an alternative 
approach by which it can be reduced in large samples to one-Way 
classification with known variance. 

Unfortunately for this problem neither there exists a 
uniformly unbiased test nor there is available a group of 
transformations with respect to which there exists a uniformly 
most powerful invariant test (see Lehmann (19 59)) . For lange 

P 

N( V- however, this problem can be reduced to one-way 

classification. Let 
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k r n - ^ (X U - x i.> ' V- 


= 1, ...,p 


It is well known that (X. ,S 2 , i = l, ...,p) is a sufficient 

statistic for this problem. The original hypothesis 
jj . a. 2 = . • • = a ^ remains invariant under the group G of 
trans formations 

y X. . t C. , J — 1» • • «»N. ( i — 1/ > •«/ p 
*ij U 1 1 

where the C. are reals and which in the space of the sufficient 
statistic induces the transformations 


y X. + C . , S . 

x i. l • i i 


Si 2 - 


i l,«..,p. 


A set of maximal invariants under G in the space of the 

2 2 

sufficient statistic is (S 1 ). 

Since 2 is distributed as the sum of squares of KL-1 

independent normal random variables with means 0 and with the 

2 

same variance a , we can conclude from the central limit 
theorem that as KL -*■ 00 

, S. 2 2 

(N.-l)^ ( _ o 2 ) 

1 N.-l 1 

l 


is approximately distributed as a normal random variable with 

4 

mean 0 and variance 2a i . 

This result is not convenient for our purpose as both the 

mean and the variance of this limiting distribution depend on 
2 

a. . However, this difficulty can be avoided by considering 
• 1 2 

independent random variables Z. = log (S / (N -1) )» i=l,...,p 
, 2 ^ 1 # 1 

instead of S^. For large N^,the random variable is approximately 
normally distributed with mean 0 i =log and variance a 2 =2 (t^-l) . 

In terms of the Z ± , the problem of testing H q reduces to 

^ problem of testing the equality of means of p independent 

'fcrmai random variables , with known variances. 

i P 

In the special case where all are equal, the have the 
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same known variance a 2 (say) but different mekns 0^. The 
problem of testing the hypothesis 6^ = ••• e p remains invariant 
under the group of transformations 


Z . 
l 


z. + c, 
1 


i — 

where C is real and under the group of orthogonal transformations 

of hyperplanes which are orthogonal to Z^ = ... = Z . The 

uniformly most powerful invariant test under these groups rejects 

H whenever 
o 


P 

E 

i=l 


(z ± - z) 2 


> k 


(3.92) 


where z = — E 
P 

of the test. 


and the constant k depends on the size a 

.. = 8 , 


i = l 'i 

Obviously under the hypothesis = 


■T — 2 2 

E (Z.-ZV /a is distributed as chisquare with p-1 degrees of 
i=l 1 

2 

freedom. So k = x, , . 

1-a,p-1 

In the general case the problem reduces to a general linear 

hypothesis with known variance by considering Y = Z /a 

i i i' 

i — 1/ ...,p. The uniformly most powerful invariant test under the 
group of transformations leaving the problem invariant, rejects 


H 


9 i = 


= e whenever 
P 


u- ? 


i = l a. 

l 



P 

E 

i=l 


z. 

l 


2 (a/) 
i=l - 


2,-1 



^ k 


where the constant k depends on the 


under 


“o' 


U is distributed as x‘ 


size a of the tests. Since 


. r we get k = x; It 

nay be noted that we are only considering here the alternatives 









UNIVARIATE GENERAL LINEAR HYPOTHESIS 141 



> 0 

for the invariant tests. Under H q , 6 2 is equal to zero. 
3 . 6 . EXPERIMENTAL DESIGN MODELS 


In this section we shall consider linear models (3.9) where the 
design matrix X does not have full rank and the matrix X consists 
of O's and l's only. 

Let us consider once again the one-way classification. 

Although the linear model adopted earlier to analyse one-way 
classification is the most natural one for such experiments, the 
following alternative model resembles more closely the models used 
in experimental designs and at the same time it gives us an 
insight into the problem of design models and their analysis. Let 


E(X ) = y+ a.., 

1 *3.94) 

V( V ■ ° 2 - J ■ 1 . N i- 1 = 1 . p< 

where X . are independently normally distributed. Clearly (3.94) 

1 J _ l P „ 

can be obtained from (3.9) by writing y = 3 = - l 6 ., a = 3 - 3. 

p ^ i X' 1 

In matrix notation (3.94) can be written as 
S(X) = X* 3* , Cov(X) = o 2 I 


where = (y, a , a )• and the design matrix X* of 

dimension N x (p+1) can be obtained from the design matrix of the 

model (3.9) by adjoining a column of l's on the left of X. 
Ckviousiy the first column of X* is the vector sum of all its 

re maining p columns and hence the rank of X* is p. It is thus 

a Pparent that all linear parametric functions of a i'***' a p are 
estimable. What kind of linear parametric functions are then 
^timable ? For any linear estimator 
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_ N - 

P i 

E £ a. x. . , 
1=1 j = l lJ 

we get 


with c. = £ a. . / 

1 j = l 


Pi. P i / ^ x 

E( Z Z a. , X. ,) = £ 2 a . (y + a ± ) 

i=l j = l i=l j = l J 


P , ^ X 

= £ c. a. + y ( £ c .j' • 

i=l i=l 

Thus estimable linear parametric functions of are of the 


P P 

form Z c. a. + y ( Z c.). 
i=l 11 i=l 1 

Furthermore we irey also conclude that any linear parametric 
P P 

function £ c.a. is estimable if and'only if £ c. = 0, i.e. 

i=l 11 i=l 1 

P P 

Z c.a. is a contrast. Thus neither Z a. nor any of the 
i=i 11 i=i 1 

parameters y,a^, ...,a p is estimable. 

The least squares estimates y* a^,..., a p of y, a^»...,a 
are given by the normal equations 
P _ P 

trt + Z N.S. = I N. Xi- , 

i ~ 1 1 1 (3.9! 

jj + ot-j = x. / i I# • • • / p • 

Obviously the first equation in (3.95) can be obtained from the 

second one. Hence they are linearly dependent and consequently 

y, cL,..., a do not have a unique solution (the same state of 

affair exists for the maximum likelihood estimates). However if 

P 

we impose the side condition that E a. = 0, then the normal 

1 1 

equations (3.91) have a unique solution and they are y = x , 

■ • 

= x^ - x . But we can without any loss of generality 

assume that £ a. = 0. For, 

1 1 

y., = E (X. ,) = y+ a, = 


(y+a )+ (a, -a ) 
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*»*• p ' “1 and l a* = 0. 


Since 


M * + “i = a i 1 a »a l a* = o 

1 1 

|j* = y .l' i y i! y .l are un iquely determined by y 
^^y such a un lq ue representation of y , 0 . s in of 

is not possible. 1 


Finally, if any linear parametric function a 0 „* + ? = 

v .l + I a i (y il' u .l ) is estimable, then a^^ + ! a * (x^-x x > 

its unbiased estimate and is obtained by replacing V, by 

their unique least squares estimates. N 1 

Thus in the case of the linear model (3.95) where the design 
matrix is not of full rank and hence the parameters do not have 

p 

unique estimates, we can, with an added restriction Z a. = 0 

•j t 

obtain their unique estimates . 1 


The model 


X. .) = M* +a* , Z af = 0 

lj l x l 


(3.96) 


is called a reparametrization of the original model given in (3.9 4) . 

To analyse a linear model which is not of full rank, we will 
reduce it to a full rank model by means of reparametrization and 
then use the results obtained in Sections 3.2-3.4. 

By a reparametrization of the linear model E (X) = we 
obviously mean a transformation of £ to ct = A$_ where A is a 

of real elements such that each element of a is an estimable 
Action, we now establish the existence of such a reparametriza- 

tion. 

1 3 ‘ 6,i * Existence of Renarametriz ed Models: Let us now consider 
^ «*.! E (x) = XB. Cov(X) = x is an N x p matrrx 


^nk 


P, 6= ( Bl .V and the N-vector X has a 
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multivariate normal distribution. Since X'X is a positive semi., 
definite matrix of rank k, there exists a nonsingular matrix c of 
dimension p x p such that 


C'X'XC = 


0 0 


( 3 . 97 ] 


where D is a k x k matrix of rank k. Writing C w her ( 

C 2 is a p x k matrix containing the first k columns of C, from 

__ _ ... _ a c-: n 


(3.9 5) we conclude that C^X'XC^ = D, C^X'XC^ - 2.* Since D i s 
of rank k, the rank of C^X^ is k and C^X* = £_. Thus, 

E(X) = XCC -1 |3 = (XC 1 ) U x e , (3.< 


(3.98 


where C~ = (U 1 ,U 2 ) 1 , and U ± 
first k rows of C -1 . Now wr 


is a k x p matrix containing the 


Now writing 


a = U.6. , 


XC 1 = Z, we get 


E(X) = Zct , Cov(X) = a 2 1 , 


(3.99 


where Z is an N x k matrix of full rank and a_ is a k-vector each 
of whose components is estimable. Note that the unique least 
squares estimate (which is also the maximum likelihood estimate) 
of a is given by 

£ = (Z <Z) ~ 1 Z 'x . 

There may be other full rank reparametrizations of the same 
non-full rank model. The following argument shows that any two 
full rank reparametrizations are linearly related and hence the 
estimates of any estimable linear parametric functions from any 
two full rank reparametrization models are identical. Suppose 
E(X) = x£ , where the rank of X = k < p, has two full rank 
reparametrizations E (X) = Za and E (X) = W X , where Z,W are two 
different N x k matrices of rank k'. Since they are 

reparametrization models there exist matrices A,B of dimension 

k x p each such that a = AB, y = BP 

— —* JL since the vectors a, X 

are of the sere dimension, there exists a nonsinguiar matrix's 
such that A = EB. In other words, any two full rank 


VARIATE GENERAL LINEAR HYPOTHESIS 145 

arametrizations are linearly related. 

irvamole 3.6. Consider Hi*._ 

^ tne one -way classification with p = .2 

and N i ' N 2 2 ' Wlth the linear model 

13-9,1 '> elx ij > = M + “i ' J = 1.2, i = 1,2. 

ft* h mV + v * = O'-VV - •« - (6,,6 2 )-. Then 


a = ujjl , where 


(3.98) | 


the 


1 , 1,0 


1,0, 1 


Since C = (U 1# U 2 ) • is nonsingular, we get 


(3.99) 


ach As 

K 


/ 1 , 1,0 
c' 1 = 1 , 0,1 

V 0,1,1 


% = 


we get. 


1 , 1 , 0 

1 , 0 , 0 

1 , 0 , 1 

1 , 0 , 1 


ht Hi -H 
Hi -H> H 
-Hi Hp H 


Hi -H 


2 = xc : = 


normal equations for the reparametrized model is given by 
I <2*Z) a = z'x . From this ^ = * 1# , *2 = X 2. * N ° W “l " & 2 1S 
t estimable and is equal to S 1 - 3 r * Thus B 1 “ B 2 = x l. ' X 2. * 

On reparametrizing a linear model to the full rank mode 
« Za, if the full rank design matrix Z is such that Z Z is a 
la 9otiai”natrix then it is called an orthogonal reparametrization 


orthogonal r 
mcorrelated. 

ization alway 
e case of ful 
be a k x k or 
1 netrix whos 
_c v•v Sinc 


CO 

Vi 

ch 

ii 

al 

s< 


14 it follows t 
X) = o 2 I, where 
ed estimator of 

/\ /v 

X - Za) ' (X - Za) . 

a) has a k-^ariate normal' distribution with mean 

matrix a 2 <Z'Z)" 1 , '(N-l) a 2 /a 2 is distributed^: 

*ith N-k degrees of freedom, independently of a. 

* 

a in terms of B as 
[ - xB) ' (X - x£) 

a solution of the normal equation 
X . 

J 

have seen in Section 3.2 that for any estimable 
function L'8, the unique best linear unbiased e 
L'B,, where £ is any solution of (3.10 2) . Simi 

A a A 

e that a is unique for any such solution 8, 

sting the null hypothesis HqjL/B. = <5 where L'8. 
arametric function, we reparametrize the lineal 
to the full rank mode E (X) =Za such that a^L'8. 












!a in 


jider V - ■(« 1 -4)/^: il o 2 , where (2 , -i _ 

^distribution with N-k degrees of freedom iJ 

testing H„ against alternatives L'B t 6 i ' “ 0ptlraUm test 

— —* ^ 0 ls given by 

' i>»eject H 0 whenever |t|> t 

^ i-a/2, N-k * ' ■, ; ( 

' Let us now consider the nmKi 

„ Plem o£ besting the null hyp 

V 1 5 ^ k in the linear model E (X) = x 

where the rank of x is k. - 


D efinitio n , 3.2 . Estimable Hypothesis: A hypothesis H is 
called estimable if there exists linearly estimable function 
t'l£. *4 1 such bhat.H 0 is true if ar.d only if l* .8 = 

I i : li i 

f in testing hypotheses in linear models, where the desig 
matrix is not of full rank, we need to restrict ourselves to 
MX) | hypothesis which are estimable in the above sense. Let us as 

ion, H 0 1S estlinab le and H Q is true if and only if the linearly 

independent parametric functions a = L' ft = 0 i = i 

— 2.*- 9 x • ••*©• 
now reparametrize it to the full rank model E (x) = Za = Z a 

■ 10 WV(2 ) where - (1) = <“i.V'. a (2) - to s+1 .a,) 1 "' 

ind ~ ^ Z 1 #Z 2^' Z l bein 9 the N x s submatrix of Z containing i 

jp, first s columns. Thus the problem reduces to case (b) of 

. can Secti on 3.4 with h q : = £_. Hence the test statistics 

equivalent to rejecting H Q if 

’ N-k > 

101)1 s q 2 ~ F l-a,s,N-k' 


where 


^ « A 

= (x - Za) ’ (x - Za) = (x - XJ3) 1 (x - x§) 


q l + q 2 = (x -Z 2 a (2) ) • (x - Z 2 a (2) ) , 


with z ■ Z a 
- 2 2 -( 2 ) 


■ ^2— 


lt ° { Xt is easy to see that like Q 2 /q l + q 2 Can also be obtain e 
th e original model E (X) i= x£ . Write ^ = ... = 6 q in E (X). 
dnd Ca H it E (x) = Z^a,^ . Obtain a solution of the normal 

| __ «■» ^ y v f ^ J ^ A 

r> Uations Ziz,a,M = Zi x and compute (x - -Z o“ f 
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3.7. TWO-WAY CLASSIFICATION 

, • cation Ui th-"* TnteractiQn: Consider a * 

3.7.1. Two-way C las si-j ute are grown on I 

experiment where I each of J different locat ions . 

different plots of e s chemical fertili Zer 

All plots are equally treated wxth the same 

and water treatment, such an (A) and varie ty of jute( B) , 

experiment, the factors 9 t levels of A and diff er 

Different locations constitute differ observatlons 

varieties constitute different levels ol a. 

(yield per plot expressed in some convenient unit) can be 
arranged in a two-way classified (I * J) table where the rows 
correspond to the levels of A and the columns correspond to the 
levels of B. The (i,j)th element x^ represents the yield of ith 
variety of jute at the jth location. The corresponding random 


vaiictjr wj. j uw -- -i- 

variables x. . are assumed to be independently normally distributed 
with mean and variance a 2 . Throughout this section the 


replacement of a subscript in a variable by a dot will indicate 
that the variable has been averaged with respect to that 
subscript. For example 


l. 


= — Z x. , 

J j=i 




v ij' 


etc. 


i 




Since there is only one observation per cell, the combined 
affect of the two factors A,B, which is generally called the 
'interaction" of A with B, cannot be measured and we shall assume ] 
Lt to be zero. Mathematically, the function f(a,b) of two 
variables a, b, where a represents the level of the factor A and 
s represents the level of the factor B, is called a function of 
interaction if and only if there exists functions g(a) and h(b) \ 
such that f(a,b)=g(a) + h(b) . Otherwise we say that the factors 
Interact. Obviously for a function f(a,b) of no interaction 
:(aj,b/ - f (a 2 #b) is independent of b. Such no interaction 
functions are also called additive functions. 

A suitable model for two-way classification without 
nteraction is given by 
(X. .) = a', + s . 


i = 1. 


aA/1 


•., I: i = 1 


(3. 
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wfc« r ® °'i denotes the mean effect of the ith i , * 

and 6’, denotes th ® effect of the ,!* ° £ the £aCtor ' 

<..»<> 

, , I o, „ aei ln the parameters 

P nf J The design matrix X is given by 

Ithe coefficients of these parameters. 


Table 3.8 


a a 
1 2 


The design matrix x 



jk ... 

ff . . . . . - • « - 

V, It is a matrix of dimension N x (I + J ), (N = IJ) # end is of rank 
I + J-l (N > I + J) • Obviously all linear parametric functions 

I are not estimable. However, we can rewrite (3.104) as 

E(X. .) = y + a. + 6 , i__- 1/ •••»!» J = l » (3.105) 


1 a. = o, l = 0 , 

^ere ... # *!.. and u are uniquely determined by P . ^ ^ 

i' 6 y “i “'i -0 '. ' 6 J' 6 i • ‘ J‘ la# l c 6 and test the 

in this model is to estimate contrasts j i i x J J 
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H . *£ = 8 1 = ••• = ® J = ° ' 

£ 'tity c is called the "effect” of the ith level of the 
The quan y i b e the excess of the mean of the i th 

factor A and is defined t _ 

level of the factor A over the "general seen V = h. S imllarl 

g is called the effect of the jth level of the factor B. Wrlt ln , 

v i} = a + H * Bj + Ty. where Y ij = lM ij " W i-* “-j + U -. ) - 

the quantity Y is called the interaction of the ith level of 
A with the jth level of B. The linear model ElX^) - is a 

full rank model in the parameters MiJ -s and hence all linear 
parametric functions of y ij 's are estimable. Thus in particular 
all a. . all 6 and all YiJ are estimable (the least squares 
estimate of y^. is ■ But if we have one observation per 

cell and y. / 0, there is no estimate of error variance a . 

_ 1 
I J 

Since Z a = 0, Z 3. = 0, in the linear model (3.105) all 
1 1 1 J 

linear parametric functions of the parameters y, a^'s, are 

estimable. Furthermore since (3.105) is a reparametrized model of 
(3.10 4) , the model (3.105) will lead to the same estimate of any 
estimable linear function as in the case of the oriqinal model. 

AS in the case of one-way classification the only linear combina¬ 
tions of ot^'s or of $j's that are estimable are contrasts. 

The least squares estimates y, oL, B^ (which are also the 
maximum likelihood estimates) of the parameters y, ot^, are 
obtained by minimizing 

2 1 J 2 

s = E Z (x. . - y - ct _ 6 ) 2 

i=l j = l J 

with respect to these parameters. This gives 
I J 

Z I (x. . - y - a - B.) = 0 , 
i=l j = l ^ 1 J 

I 

Z (x - y _ a - B.) = 0 
i=l 1 J 


^! <x fj - 5 - 5 i - B j> - 0 • 





J - NEAR hypothesis 

Since L a 1 - 0, I j 

1 1 1 J ~ 0 We can use £ , . . ff 

rly M"Z theS : n0rnBl ^ ena this j ‘ " 

V "**- ' 0,1 Xi - ~ . B * x 

Clearly these estimat J * j “ X 
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i-hg 


* 2 B s o f 

l j 0 for 


gives 


f 

a 

ar 


— mares are uni 

" which ls the minimum value of V’ The error sum of squares 

f ° r 911 P° s ®ible values of'the 


parameters V, a. « s o . . 

l j s> is given by 


The degrees of freedom ls clearly tj . (I 

The sum of squares due to the hypothesis H + + tl ' (1 " l) <J ’ » • 
of H a is the minimum value 


11 

e 

1 of 
ny 
• 

na- 

the 


| i=l }h (X ij - H - 6j) 2 
with respect to the parameters 


y i 8j 1 s and is equal to 


M ''1 


* J p I J 

R ^ x ii “ x J = s 1 (x. . - X - X . + X j 2 + 

&1-1 j-1 -J i=l j=l 1J 1 * *J 

W I ■ 2 . 

E J(x. - x ) 2 . 

/- • ■'i 2 1 • • • 

Hence the sum of squares due to the deviation from the hypothesis 
H a ° r equivalently the sum of squares due to rows is given by 

I J(x. -x ) 2 , with degrees of freedom (IJ - J) - (I - D (J - « 
a I-- U Similarly with the sum of squares due to the deviation 
fr ° m the hypothesis H is « * <*. j “ " ith J - 1 degrees 

of freedom. Thus the likelihood ratio test of level a for testmg 


fs equivalent to rejecting it if 



(I. 


> F 1-0,I.-1* (I- 1} (J ” 1) (3,1 ° 6) 



i-1 j=l 


V / v —X —X - tX ) 

ij i. •' 
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4-est of ”3 

sad the laex^ocd ratio ’ " 


J v ) 2 
I I(x i rX .. 


^Twl) (JrU. j +x ) 2 

J-1 J £ -J •' 

i-1 J-1 verify directly that the Se 

yhe reader will «n* « ^“H^test statistics have the 
are liKelihcod ratio *£•£**„ null hypotheses. 
F-dxstribution under t . £ic ation without interaction 

The ANOVA for the two-way cl 
is given in Table 3.9. Note ^ 


(I " 1) (J - 1) • (3'.iq 7) 


j 1 1 2 a 2 = T"T L ® 

a? = -i,r 1 a i ' B J" 1 j = l 

A I- 1 1 


2 

j * 


,.9. ANOVA table for two-way classification 
Tat>1 without interaction----- 


f npnrees of Sum of squares 
Sources of Degrees u 

variation freedom 


Mean sum Expected 
of squares mean suit, of 
squares 


Rows 


Columns 


2 S I 

*i= ± i x J(x i.- X .. ) m 

2 J 2 S 2 
S 2 = E I (X .-x ) •=—T 

2 j-1 *1 •• J ' 1 


0 +Jo 


0 2 + 10 2 


Error 


•'9 1 J 

(l-l)(J-D s = £ £ (x. .-x. 

3 i-i j=i 13 l- 



-X .+x ) 2 

• 1 ■ • 


Total 


I J-1 


I J 

1 £ (x .-x ) 

i=l 1 = 1 1J 


and the dii 


Thus the 
al ternat 
given i n 

H 

4 a 9ain s 


If H d 13 re J ect ® d then it is necessary to divide the a.'s into 

groups such that belonging to the same group are equal and 

those belonging to different groups are different. This is ** 

by means of the t-test or by multi^i- ^ C aS e 

of ftnc ua „ Y xutl P 1 e range tests as in the 

ot one-way classification • +ed 

ion. The rejection of H„ could be treated 
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Table 3.10. Average yield per plant in grams 


Crosses 


Locations 

2 


101.32 

103.04 

91.78 

77.08 

76.36 

97.40 

91.96 

68.16 

85.62 

92.46 

99.66 

77.64 

91.62 

89.18 

105.32 

92.62 

73.74 

78.74 

72.18 

74.02 

72.27 

77.68 

82.96 

93.66 

108.72 

110.14 

109.28 

85.46 

69.64 

86.28 


90.22 

t 

V 

95.32 

j ifit 

96.16 

j 

! 

38.42 

l 

80.26 

j 

84.78 

j I 

82.74 

i “ 

90.48 

1 

104.36 

j The h 


68.96 


Table 3.11. ANOVA for Table 3.10 
Sources of variations Degrees of freedom Sum of squares F 
Between Crosses q 


Between Locations 

Error 

Total 

Since F _ ^ 


3284.6025 
110.4387 
1192.8469 
4587.8881 


4.9445 

0.4987 


lrce F C.9,27 ■ 3.15 for a = .01 we reject « c « t a = .01. 

cbnslder"th e twoI^ay^i^T^^ 1 ^ nn-s Here we will 

observations per cell L lcation with an equal number K of 

the (i»j)th cell i Gt X ijk den °te the kth observation in 

X ijk be independently a ■ *. • * J k - 1 , ...,K and let 

same variance o 2 a A 1S ributed normal random variable with t 
E (X ) _ and 

iJk) ' p iJ • 


1 ^ 
# ob t 
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It is a linear model in ... 


parameters y,.'s. Its design matrix. 


given by the coefficients of y s in (3 ^ U “ X ° n 

i Honoo all i • M ij in ^ 3 * 109 ) is a matrix of full 

rank. Hence all linear parametric • _ 

lc tUn ctions of y . . ' s are 

V estimable. ij 

} In terms of the qeneral , 

A ycnerai mean y, effects ot.,£. and 

interactions y^ we can write 1 J 

1 s< x ijk> * v + “i + B j + Tr ±J , 

1 = 1 . Z ‘ J = 1 . J - H = 1.K, 


■L “ -L J 

Jj ot. — 0* Z 6. = 0, Z v — n y 
1 1 1 J i=i Yi J ' jfi 


Y. . = 0 
ij 


The hypothesis that we usually wish to test 


(i) H 0 : a z = 


= a =0 

J v 9 

= 6 J = o , 


(ii) H g : B 1 = ... = 6j = o , 

(iii) H y : all y = 0 . 

The least squares estimates (which are also the maximum likelihood 

estimates) y, a 's, B.'s and Y 'sof y, a.'s, S.'s and y. ,'s 
uy i j. ij i j 'ij 

are obtained by minimizing 

| s2 = l j * (x ijk - V - - Bj - y.j) 2 

2 

with respect to these parameters. Differentiating s with respect 
to these parameters and equating these partial derivatives to zero 
we obtain a set of normal equations which, with 


£ 5. =0, £ B. = 0, £ Y • ,=0, £ 

i 1 i j i=i 1J j=i 

give us the unique estimates. 


L ? ir° 


P = x 


a . = x. - x 

l l.. 


, B. = X . - X 

J . J. 


Y ii = (X. , - X. - X + X ) . 

*J lj. 1.. .j. ••• 2 

error sum of squares which is the minimum value of s with 

r ®spect to these parameters is given by 
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with IJK-IJ 


I Z I (x R - x ij. J 

i j H freedom. The sum of squares due 

With ijk-IJ=W(K-D degrees ° minimum value of 

the hypothesis H £ is ^ven by 2 

III (x ijk " y ' 6 j ' 

i jk . r _ u $,'s and Y il' s ' ana 15 given by 

a. n the parameters M» p j U . y 

with respect to tne F 2 

L Z E (x . v -x. . ) 2 + JK E(x i.. “ X *** 

ill ' ijk U« i _ 

i j k f £re edom. Hence the sum of squares 

with IJ(K-l) + I- 1 hypotheses. Which is also called the 

due to the deviation from the^hyp^ ^ ^ by 

sum of squares due to the 


of squares due to the 


JK f (x i.. " x ...* 

With'l-l degrees of freedom. Similarly the sum of Squares due to 

the effect of B is given by 

. 2 

IK l (x - x ) 

. .J. ... 

with J-l degrees of- freedom. The sum of squares due to the 
hypothesis is the minimum value of 

■ ? i " ■ B j’ 2 ' 

with respect to these parameters u, a^'s, Bj's, and is equal to 

I T I (x. - x. , ) 2 + K l E(x. . - x. -x. +x ) l 

i j k !J k !J- i j iJ . i.. .J. 

with IJ(K-I) + (I — 1) (J-l) degrees of freedom. Hence the sum o 
squares due to the deviation from the hypothesis is given 

K l t (x - x - x . + x ) 2 

i j iJ. • i.. .j. ... » 

with (1-1) (J-l) degrees of freedom, it may be remarked that 

except for the parameters w>yi^ „ 

pu ers which are zero under H , H Q and H , 

rrr the ieast squaces -«■«- -aer 

the same as under n (the parametric space for this problem: 
The likelihood ratio test n* i , . +h 

equivalent to rejecting H whenever' “ *“ teStin9 
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I ? 

JK Z(x. -x ) z 

13J&JLl -— — - > F 

1-1 Z Z Z< v .2 1 - a .I-l,IJ (K-l) (3.110) 

i j k VX ijk- X ij. ) 

likelihood ratio test of level a for testing H_ is equivalent 
to rejecting whenever — 

J " - 

IK Z (x . - x )* 

IJ (K-l)_1 


— \ . - A 

^ »J • . . 

Z Z Z (x. -x. . ) 

nk ^ 


> p # 

- 1-a, J-1, ij (K-1) ' 


and that of is equivalent to rejecting whenever 


K Z Z (x. . -x. 
i 1 1J * 1 


K Z Z (x . -x -x . +x ) 
IJ(K-l) i j J * * * ‘ ** 

(I-1)(J-1) l Z l (x. . k - x. ) 2 

1 J k 1 J K ij • 


(3.111) 


l-a,( I-l)( J-l) ,IJ(K-1) , (3.112) 

The reader can verify these facts directly as in the case of 
one-way classification without any difficulties. 


2 1 _ 2 _1 y o2 2 _1_ y Tv 2 

°A = T-T Za i' 3 " J-l ? 3 j' A3 ( I— 1) (J-l) \ 

i J 1 J 


If the hypothesis H a or is rejected one can use the t-test or 
the multiple range test to find out which pair of^ the a^s or 
which pair of the 3 .'s are significantly different. If is 
rejected then one would not ordinarily explore it furthe , 
although theoretically t and multiple range tests can be applied 
to find out the interactions which are significantly different 

from zero. 

in the same way as in the case of two-way classification 

without interaction it is easy to verify that the test given in 

(3.110) is uniformly most powerful invariant for testing H 

I 2/^2 > o N ote that here r=I-l, 
against the alternatives JK 0.^/0 

s = IJ, N-s=(K-l)IJ. 
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if 158 analys.s fort 

, 12 ANOVA ta tion 

Table 3.1 2 - .,^h inter _ 


two-way classification 


with 


Decrees 

Sources o£ freed° m 
variation • _ 


Effects of 
A 

Effects of 
B 

interaction (1-1) (J-U 


of Sum 


0 f squares 


-x ) 


.2 = jK f(x i / x ... 




s 2 =E ? S(*1J“ 

1 J . x -x ) 

• J • • • • 


Error 


Total 


I j (K-1) s 2 = £ (x ijk" X ij 


Mean sum E X p ec . 
of squares sum 

___£auar e . 


o -K3K 0 2 
A 


jTT u 2 +3Ko^ 


2 ( ~iTT~i)° 2 ^ a L 

s 4 2 

IJ (K-1) ° 


IJK-1 


1E l (x ijk- x ...’ 
l 1 X _ 


Similar results hold for the tests given in (3.111) and 
(3.112) . 

3.8. THREE-WAY CLASSIFICATION WITH INTERACTIONS 

Suppose there are three factors A,B,C affecting the observations 

the factor A has I levels, the factor B has J levels and the 
factor C has K levels We <?h = i i , 

lumber of M observations for each tH ^ 

“hare i denotes the level of the f * comb ^ation (i.j,k) 

of the factor B and k denotes to. T " *' J 'k' 10 * 65 the level 
x ijkm he note the mth observati SVel ° f the factor c - Let 
‘ * I-...!, I ! 9 n T ° n ^ treat -ht combination 

ljkm - e independently distriv ~ 1 . . m = 1 .. and 

with mean j „ lbute ^ norma] 

ijk and the same al rdnd om variable 

° r each f ixed levei y *° * 

l i^i.k-, jk+> . k ° f ^ VectQr c 

• • 1 


“ th * interaction of 


m 


the ‘hh level 


° £ " With the ith level 












ables 
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If we take tha average of this 

get Y for 311 levels of c we 

a AB = (y - y y 
lj 1J * • 1 * • • J • ...' 

f Xt ±S called the interaction effect of 

with the jth level of b. Similarly the ! ^ ° f A 

ith level of A with the kth level of r 10 SraCtlon a ik of the 

o* tna j* i.vi c* b With the eve r: f t interacti ° n 

ar ex of c are given by 

$ V (M i.k^i..^..k^...^ 

“jk = (y .jk" y ..k " p .j. + » mmm ) . 

A 

The effect a of the ith level of A is obtained by 
averaging for all levels of B and c and is given by 

= (yj -u ) . 

1 1 • • • • • 

Similarly we define 

“j = 4 ‘ «K.. k -M...). 

Tha quantity „ is defined as the general rean . As there are 
three factors it is also necessary to have a neasure of the 
combined effect of them. Let 
ABC 

“ijk = (p ijk _p i.k _y .jk +y ..k ) 

Obviously it gives us a measure of how the interaction of the itl 

level of A with the jth level of B when the factor C is ‘ at the * 

fixed level k differs from its average value when it is averaged 

levels of c * There is no dependence on C if and only if 
QflOL .. ABC 

ijk = Tll e quantity ot ± is called the three factor 
lr >teraction between the ith level of A, the jth level of B and 
t,1e kth level of C. 

With these quantities we could write 
c _ _ _ _ 


E(X ijkm ) = H + ct A + a B + a? + a AE 


C AB BC , AC , ABi 
: + a + a.. + a.,. + a, ... 
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Z a A = 0, 


j 


v AC a 

I a ik = 0, 


, ABC _ n 

J a ijk " °' 


, B = 0, 

J 

i 55 0 * 


k 


AB n 

lj 

■ "• 

BC n 

k “jk ' ° ' 

a AC = 0 
a ik 

„ ABC _ 

# S a ijk - 
i 

0 , 

ABC 

a ijk 

0 . 

-* 


As in the two-way classification, the hypotheses of main interest 
are all effects of each factor are zero, all two-factor 

and the three-factor interaction is zero. 


interactions are zero 
For example, for effect of A 


H, 


a A = 
1 


= a A = 0 
Ii 


Similarly, for the interaction of A,B .and for the interaction oi 
A, B,C 


H- n : all a A ® = o 
AB lj 


H : all a ABC = 0 
ABC ijk 


The sum of squares due to the deviation from each of these 
hypotheses and the error sum of squares can be obtained in the 
same way as in the case of two-way classification. 

sexisTl‘ , " r “ eStlnBteS <WhiCh are the — - the 
imum livelihood estimates) of these parameters are given by 

~ab 


Vi = X 
T ABC 


“i = X i “X 


a 


ij 


U ij.' X ’ 


a iJit = -X 


) , 


• • 


’ijk.- iJ..' X i.j.- x . Jk +x +x 
and similar expressions for other ° 

sum of squares due to the d effects 

and the error sum of squares ' t i° ns ^ orm various hypotheses 

es are given in Table 3. 



i • • . j 

+x -X ) 

•a eeJV. eeee 

and interactions. 


13 . 
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UNtflK HTrOTHES I S 

3 . 13 . ANOVA table for the th»- 

interaction ree-way classification with 


sources of Degrees of 
variation freedom Sun of squares 


Effects of 
A 


1-1 


Effects of 
B 


J-l 


Effects of 
C 


K -1 


PC 



Mean sum of Expected 
squares mean sun of 
squares 


Error 


si? = JKM I ( ofy 2 
i 1 

•t 

i-l 

2 2 
C4JKMO, 

A 

s\ = IKM l ( a?) 2 
j j 

4 

J-l 

2 2 
o +HCMCC 

D 

s i = IJM Z ( 6?) 2 

3 to k 

2 

S 3 

K-l 

2 2 
a+I JMo^ 

s 4 2 =KMH(aV 
i j 1J 

2 

s 4 


(I-lj (J-D 

o + Jsjyia__ 
AB 

s 2 = JMH( a??) 2 

5 i k lk 

4 

2 2 
rr -uTmtt 

(I-l) (K-D 

° 

2 , -BCx 2 

4 - » HI a jk ) 

j k 

s 6 

2 _ 2 
° +IWa B C 

(J-D ((K-l) 


s 2 


s 2 = M 2 2 

7 i j k 1JK 

7 

2 2 
— O'fMC 

(I-D (J-D (K-D 

^ABC 

s 2 -£ £ Z £ ^ijknT^ijk.' 
i j k m 

2 s 2 

IJK(M-D 

2 

0 


Total 


IJKM-1 


Z l l % . 


i j k m 



, 62 analysis of variance 


Where 


2_L E (a A ) 2 f 

— T 1 T 


I - 1 i 


,2 _ 


s i 2 ' 


- l ^ ''"ii 

iirinjn) i -i lJ 



°ABC ' "( 1 - 1 ) (J" 15 (K " 1) 1 K 


„ / ABC* 2 

E £ £ (a ijk ) 


and sinvilar notations for a . V ^ ^ hy p 0t hesis, say H^ fi 
The likelihood ratio test of 
is equivalent to rejecting H AB i 
2 

(J-U-IEJKlM-l) , 

(I- 1 J s it as an exercise 

: -i,, f nr other hypotheses. We 

and similarly for 1 t DO werful invariant 

to the reader to verify the uniformly most power 

property of these tests. 

The case of higher-way classification is too complicated 
to reproduce here. 

EXERCISE 3 

1 . For the linear model ( 1 . 3 . 9 ) with rank X. = r * p , show that 
at most r linearly independent linear parametric functions are 

estimable i 

2 . Let ...,X 6 be independent random variables having a 
common variance d 2 and expectations given by 


E(X^) = + Bg # E(X 2 ) = & 2 + e 5 

E(X 3 ) = 83 + S 6 . E(X 4 ) = 8 4 + 8 6 

e(x 5 > = + 83, e(x 6 ) = 6 2 + 8 4 

Show that 6 X - 6 2 is estimable and find at least three unbiased 
estimates of (83-63) . Find also its best linear unbiased 
estimator and find its variance. Establish further that B.+B, 15 
not estimable, obtain an unbiased estimate of d 2 . 



Consider the linear 
3* _ n i o • 


UN| VARU Te 


”° de 3 given 


GEner * l linear 


HVPothes 

the 


( s uch that I' 6 is estiroable find _ 

sqUarS eStl,Tat0r to i-i is ° f Vari — of i 

„ r%j. For the line ’ r modei (3.9) show that 


ls 163 


P~vector 

its 


is the best linear unbiased esu^r 


>■ :V Ul »» — “"«M estimator of kit .« , 

' COV<k'i^'i»= 0 -13 N-vectors b satis £ying ^ ( f,^ if 

»• FOr r e wilre ear t ” del (3 ‘ 9) d6fine ^ P functio: 


ons 


j 1 . . 


J J 

n being the jth column vector of x. 

-J 

(i) Show that if the linear functi 

l 4 - 0 rK 


on 


Ml + 


+ 

P P 


J 1 * f 

has zero expectation then it is identically equal to zero. 

(ii) Show that every non-trivial linear function . ■ + *' p <l>p 

is the best linear unbiased estimator of its expectation. 

6. Let X , ...fXj,, N > p + 1, be independent random variables 
with 


E(Xj = <* + S i< x li - X l ) + ••• + S p 'VV 

ffM&g =0,1- 1, 

. are known constants, 

M WW rJ N 

l> -x, - i 


N i=l i 1 (3 _ 9) aI1 d assuming that 

Express this as a linear model as giv^^ estimators of 9. 

■ X is of full rank obtain the es Assuming that X has 

‘ ..B and their variances and co a ^ f the 

. I'd . ,1-i.rihution, fina 3 


an 


...,6 and their variances ana . tests of the 

P , distribution, fina 

N-variate normal 


following: 


(i > H 0 = “ = “0 ' 


= B°.V B 


0 


„ Q =: B .»•••' '"fl P 

(ii) H 0 ! 

... = e , 


(iii) 


H, 


w v u • 3, ' r P 

. i-j-i) (i v ) H 0 * 1 

• “• • s< 

are spe c 


‘0 * w i ' J 
where "0 -0 


^ B°. B 


‘. *0 

vw* Consider the 

Br 


, ,3 9) «ith 
linear model ( 3 - 9 ’ 


J8 , Cov(Y) = 0 V 

gW = X/l 





' 6 , tive definite matrix of dimension N * N- 

where V is a known P°“ unbiase d estimator of an est inable 
Prove that the best n by L'B where 1 rs any solut ion 

parametric function £ £ 1 

of 

X'V- 1 ^- X 'V *2 • problem 4 also holds for the U„ ear 

8 . . prove that the result P 

rodel assumed in pro lt nentioned in prbhlem 8 shov 

8 . As'an application of the res 

that; if 

E(X ) - V ♦ 0 .1 <N-i + »- ■ r = . . 

r i=l 

o r , . . - \ - 2 i<r<s<N, 

„ , v y \ = a 2 l N-1+1) * 1 - - - 

Cov(X r ,X s ) i=1 - 

. . acp s estimators of U and a are given by, 
then the best linear unbiased estima 

„ NX--X a N X-X 1 

y= -Nrr and ’ 0= "NTT ' 

where X = £ * X i 

10. Let X = (x x .Xj,) ' be such that 

E (X) = U , Cov(X))= S. 

Show that 

e(x»ax) = y'Ay f tr(Al) . 

11. For the linear model (3.9) assume that X has a multivariate 
normal distribution and X is of rank p. Prove that 

l_a=p{|i' (B-P) |< (^ )F l-ot / ,p # N-p)^ (a) (£' (X'-X)” 1 *)* 5 for all £ f 

12. In (3.9 3) show that under H . : 0.. = ... = 0 , U is 

u 1 p 

distributed as a chisquare with p-1 degrees of freedom. 

13. For the linear model (fixed effect) 

e(x.) = e 1 x li + e 2 x 2i f i = l,..., n, 

Cov(X ± ) = o 2 , 

X l'..*'Xfl are independently normally distributed the 
following data were obtained- 




13.0 


13.0 


29.2 

4 


HYp 0THES,i U j 


33.1 


32.0 


IjL 

_<y 


46.2 

8 


sm 8 


s bov 


-K, Find the least squares estimate of ^ 15 

■ \ Test the null hypothesis h . ese P a rameters- 

U 1 ' o * Pj_ = o. 

f l«. For the tW °- Way c ‘*' ass lfication v lthout 

fo i lowing; erection find the 

( .j the 9 5% confidence interval of a _ a 

Mi) the 95% confidence interval of R 1 \ 1 ' ) 

11 P j p i ■ ( 1 * j ') . 


ni ) the v.uiiiwwtt.'B interval of r ~ • 

11 P j i ( 1 * j ') . 

.c Table 3.14 gives the birth weight (in * 

I 5 - , ^ 9 C Un lbs ) of six litters of 

the litters being of different <?i 7 0 c * •. 

pi9 s ' erent sizes. Analyse the following 

2 - 4 -a to sxciinin© if iitto^r siz© is 3.11 agqi rmav>i _ 

daf a assignable cause of variation 

Jiven by, V m birth weight. 


Table 3.14. Birth weight in lbs of pi gs by litter 


sizes 


litter size 

weight 


iriate 


t o -> 



2.3 

1.7 

1.9 


__ £ m I _______ 

16. Suppose we want to determine the individual " eiqh ^_ ^ 

1 by weighing them in a balance. One o£ these 

«ch one N times, and estimate its weig^^ ^ each . obse rvation, 

I “observations. If ° 2 is the error v ^ second method is 

| then the precision of each estimate is a . * ^ gome of these 

! t0w eigh the objects in the following wa ^' and place weight 

3 in one pan and the rest m the ° according as the J 1 

.. . . _ n 1 or - 1 i 


3 in one pan and the rest m t ^ ^ according as the 
i^'hieve equilibrium, bet x^ = °' or placed on the lef1 

is not used, placed on the right P 
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' 6 

f*n —iri,e itt «*+**' 

e quiH brlUm w = x. » biect. Suppose this 

a) + ••• + iP p 0 £ the ° N) using diff 

where ** « ** “ repe ahed ■ ^ pans. Thus we get a 

weighing P««« <i obJec ts for dl«« u ) ■, from which oj 

combinations o qUa tions i n - " 1 ' T £ e design matrix X 

system of - “-“tans of feast squares. ». ^ x or shb „ 

can be e»ti«“ elements are individual 

is the N * P »“ lX * estimates of these Qr ^ 

that maximum precision of nts of X are eith 

^ „ an be obtained if tne 
weights orthogonal. 

and the columns ^ inaividual weights are to 

«• objeCtS in a common balance according to the 

na determined, ere t lndlcates that the object was 

following design. 7 indicates that the object was 

placed on the right pan and -1 

placed on the left pan. _____— 

o, o, 0 3 o A weight 

x 1 1 1 20.2 

1 -1 1 -1 8.1 

11-1-1 9.7 

1 -1 ^1 1 1.9 

1-1 1-1 8.2 

1 1-1-1 10.4 

1 - 1-1 1 2.0 

-i_1_1_1 19.9 


estimates of the weights 


ssuming that the errors are independently normally distributed 
ith zero mean and common variance o^, 

i> Find the maximum likelihood estimates of the f 

hese four objects; e wei( 3 hts of 

ii) Find the estimate of a 2 . 

3. The data in Table 3.15 rel si-• 

Ut) of daughters of 5 sirp _ lng t0 lacta tion yield (10 lb 
‘"krej hera maintained at theT £r ° m the ra =“<as of 


at the I hStitut( 






er ent 


Table 3.15. 


Daughter 


Lact 


5 ti Qn 




yi ^a Uo 


near 


my '“°thes, s , 


Un its) of 


Show 


ire to 
the 


Analyze the data to see if lactation yields for different sires 
are different. 

|l 9. For the maximization of phosphate potential in soils of 
jfcdras state, India, a scheme was initiated by the Indian Council 
’of Agricultural research in Madras during 1961-196 2. The details 
°f one of the experiments conducted under the scheme are given 

below. 

ta) Treatments applied 

1. Control 

2. Superphosphate 30 lbs per acre 

3. Superphosphate 50 lbs per acre 

^ • Rock-phosphate 30 lbs per 

, Rock-phosphate 60 lbs per ^ farm yard manure 

Superphosphate 30 ibs ar£ ^ 

per acre 5 tons of t*™ yard nBDUte 

• Superphosphate 60 lbs a 


per acre 


Per acre 










infffji'aBi 


g* : ’ 

BP “ 


6 SAN »LVSIS0F • s tons ~- 

30 1°* 

„ R oclcpl»o s P ha farm yard per acre 

8- j t tons OJ- 

acre _ 0 j_bs and 

Z~ 5 tons ^ aer i 

10 • Farm (c) crop sown Rag,. 

» Mot 5126 M ‘ ’ yleld of Rag^i^ffll^-- 

_----- 


VAR‘ ANCJE 


0 f farm yard manure pe t 


rocke 

Farm 


plot size 20' X 


Table 


Repli ca ‘ 
bio ns_ 


5.62 5.00 
4.50 5.50 


4 

5 

6 

4.00 

5.50 

3.50 

4.25 

4.75 

5.50 

5.00 

4.50 

4.50 

4 .00 

4.50 

3.70 

a two- 

-way classic 


5.00 5.25 

4.75 3.75 


5.00 

5.37 

5.75 

5.50 


possible conclusions. 

20. Let (x-a) 'M(x-a) < 1, with M a symmetric p x p positive 

definite matrix, be a ellipsoid with centre a_. Show that for any 
h= (h , ...,h ) ' * 0 , the point x = .(y,.. ) ' lies between two 

* P ^ P 

planes of support of the ellipsoid orthogonal to h if and only if 

lh'(x-a)|< (h'M - 1 h)^. 

21 . - Soufcg : Statistical analysis in Psychology and Education by 
George A. Ferguson, McGraw Hill series in Education p.253. 

^ . Given in Table 3.17 were obtained from an experiment 

designed to study the effect of two vari *0 

performance of rata ln . mze ^ w ”* leS ° n reaSureS ° f 

bright, mixed and dull a . th three strains of rats viz 

; - auii. a grouo of o . 

reared under, free or re^t^ *. ats from each strain was 

restricted environmental conditions. 

Environ- ' ~—■- - Ta b le 3 17 

merit " --* 


fright 

26, 14 

4 1 , 16 
28 * 29 


-- 

■—^®^__Dull m ent -- 

- 4i » B 2 ~ 3 T~~irr— 

*• 86 3»; 99 W tri °- 51 ' 35 


45 


59 »126 


il27,| 


._ Strain 

.Bright Mixed Dj; 
‘ 5l »- 35 39 , 114 42 , 133 

36 104 , 92 92,124 

97 ' 28 137 , 87 156 , 68 

122 . 64 144,142 
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l, Analy ze t * ie data as a two-way layout under normal theory 
model. 

2> do the data show a significant interaction between two 
experimental variables? 

2 Use T- and S-Methods to decide which pair of strains differ 

significantly. 

22 source: Engineering statistics by A.H. Bowker and S .Lieberman. 

To study the effect of plate temperatures and filament 

lighting on transconductance of certain type of tube an experiment 

wa s performed with two different plate temperatures 550°F, 600°F 

and four different filament lighting current and L 4 . 

Three replications of each combination of plate temperature and 

filament lighting current were used. The data recorded on 

transconductance measurements are given in Table 3.18. 

Table 3.18 


Plate 

temperature 


Filament lighting current 


550 °F 


600°F 


3774 

4710 

4176 

4540 

4 369 

4180 

4140 

45 30 

4 374 

4514 

4 398 

3964 

4 216 

38 28 

4122 

4484 

4524 

4170 

4 280 

4332 

4136 

4180 

4 226 

4 390 


Make a complete analysis of the data under normal theory 
assumption. 

. 1 anaivQi s bv C.H. Goulden, 

23. Source : Methods of statistics 

John Wiley, p.169. 

. volumes in milliters of loves 

the data in Table 3.19 give froro 1Q0 ^batches of 

°£ bread made under controlled con ^ wheat flour and 

fough made with 12 different varie 1 iuin bromate. 

Attaining x = 0,1, 2. 3.4 -Hi— ^ ^ ^ ^ 

^lyse the data as a two-way layout un 
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Variety of 
Wheat flour 


Table 3.19 


Volume 

1 

10 75 
980 


for x 


1055 


1040 


1065 


1000 


1000 

1015 


Duncan, 


24. Source : Basic statistics for health science students by 
David S. Phillips, 1978, W.H. Freeman and Company, San Francisco. 

Table 3.20 gives measurements on pulse rates taken from an 

experiment conducted to see if an exercise programme for CV 

patient had any effect on pulse rates. The pulse rates of 10 

patients were measured before entering the programme and after 

being in the programme for 1,4 and 12 weeks. Analyze the data 

under normal theory model to see if the programme affected the 
pulse rate. 

cUssIrV" 6 data PreSented in Table 3.19 as a one-way 
correSDond' 10n Wlth ” ° bservations for each of the 4 population 
then ““lb 0 h ValUes ° f *• Test under normal 

techniques of Section^T* 5 ^ eqUality o£ variances by using the 
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Table 3.20 


Patient 

Start 

1 week 

4 weeks 

12 weeks 

1 

90 

87 

88 

90 

2 

80 

80 

78 

74 

3 

70 

75 

73 

75 

4 

76 

82 

75 

72 

5 

82 

80 

74 

70 

6 

85 

83 

80 

76 

7 

100 

95 

85 

8 2 

8 

80 

82 

75 

75 

9 

90 

95 

82 

80 

10 

75 

75 

70 

72 
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CHAPTER 4. ANALYSIS OF COVARlANrr 

VARIANCE, OPTIMALITY of designs 


in general a scientific experiment 

is unknown about a group of tr / erf ° rnied to know something 

hypotheses about the correspond' ^ and to test certain 

yP responding treatment effects The 

treatments are applied on P v„ ar ^ , * ine 

P n ex Penmental units selected for 

the experiment. Different scienHf^ 

- . scientific experiments have atleast 

one feature in common, that is . , 

. . ls ' the variability of the 

experimental units, when variabilitv o-f 

• * =n ■ ariaointy of the experimental units 

is small relative to treatmpnt ^-s -c* 

. „ . . . tmen t differences and a complicated 

experi a design is not required, for the experiment, the 

experimenter simply takes a large nunfcer of observation;, if 

feasible, on each treatment effort 

dcmenr effect and computes their mean. 

The variation around the mean 

a nne mean can be made as small as desired 

by taking more observations. 

When there is considerable variation from observation to 
observation on the same treatment and it is not possible to take 
an unlimited number of observations, the techniques commonly used 
for reducing the variation are 

1. Use of a proper experimental design, 

2. Use of concomitant variables. 

We shall discuss here these two techniques in the general 
framework. First we shall discuss the use of concomitant 
variables in linear models and their analysis. The analysis of 
linear models with concomitant variables is called the analysis of 
covariance. Then we shall discuss different criteria for selecting 
the optimum design in a given situation. Specific experimental 
designs along with proper allocation of experimental units and 
ra ndom allocation of treatments in blocks, their advantages and 
disadvantages and their analysis of variance and the analysis of 
50 variance for some'of them will be treated in Chapters 5 and 6. 

If the above two techniques fail to control the experimental 
'^lability then the number of replications of different 
Aments (in. other words the number of experimental units) will 
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neea to be increased to a point where ade q uate control of 
variability is attained. 

4.1. ANALYSIS OF COVARIANCE 

«, for given experimental, material, the use of proper 

experimental design cannot control the experimental variation, 

the use of concomitant variables which are realted to the 
experimental variables may be effective in reducing the 
variability. Consider the one-way classification with 

E < x ij> “ 6 i' 

Var (X i .) = o 2 . j=l.....» 1 , i=l,....P- 

If the ordinary analysis of variance for testing the hypothesis of 
equality of treatment effects shows a highly significant 
difference in treatment effects due to some factors affecting the 
experiment, taking the following model (which takes into account 
^fche effects of such factors) 


E(x ij> = 6 i + yt ij 

Var (X^,) = a 2 , j = 1,...,^, i = 1, ...,P, 

where y is the regression coefficient on the concomitant variables 
t , (which are related to X^) , the variability of'treatment 
effects can be considerably reduced. In agricultural experiments 
with plots of land as experimental units the t^ may be a measure 
of the fertility characteristic of the jth plot receiving the ith 
treatment. For experiments with animals to compare the growth 
rates of groups of animals receiving different diets, the observed 
differences in growth rates can be attributed to diet only if 
animals are similar in some observable characters like weight, a<3 e 
etc. which influence the growth rates. In the absence of 
similarity one can use t^ as the weight or the age of jth animal 
receiving the ith treatment. 

, If i ” (4 : 1J ° ne WiShed to co »si<3er the quadratic regression 
in t.. instead of the linear one, that is, 

® (X.t ^) = 3, + y.t. . + v^t^ 
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the analysis of covariance 

, analysis of covariance „ ith t ^ <4 ‘ 2 > iS the ““ “ «» 

tMO c °ncomitant variables t.., t 2 . 

In the case of the tun ^ 

... y classification of Section 3 with 

% one observation per cell i • ecrion j, with 

*f hl „ „ the llnear ">°ael with concomitant 

variables is given by 

e(x. J = y + + & + yt 

u 1 j ii 


E (X. .) = y + a + e + Y t 


1 ij + Y 2 w ii' 


(4.3) 


where 


1 , j — 1,. 


(x ij' t ij ) ' (x ij' t ij # “ij) are observations in the (i,j)th 
cell and ^ j » “ ± j ar « concomitant variables. 

Though we shall consider the observations on concomitant 
variables as constants selected by the experimenter, in some 
experiments it may be more realistic to think these observations 
as values taken by some random variables. In such cases one has 
to treat E(X^j) as the mean of the conditional distribution of 
the X given the observed values of the concomitant variables. 
Obviously all distributions derived in Chapter 3 for statistical 
inferences with respect to this conditional linear model is also 
conditional as are the level of significance, the power and • 

confidence coefficients . Since such an analysis requires 
discretion as to the joint distribution of the concomitant 
variables and has little applied interest, we shall not attempt to 
deal with it in this book. We shall now consider the analysis of 
covariance of one way classification, two-way classification with 
°ne observation per cell and the linear regression model. The 
analysis of covariance for some specific designs will be treated 
Chapter 5. The analysis of covariance of higher-way 
classifications and the two-way classification with more than one 
•observation per cell and other designs are left as exercises for 

deader. 


and l a. = 0 


J 

l 3 . = 0 , 

J=1 J 
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„ . . ration ! we have already discussed th, 

- ne V SX -rTToneTway classification with p normal 

analysis of variance o ^ ^ discuss here the analysis 

populations in ap ion with one concomitant vari abl 

invariance of such a classic ‘■'■‘■e, 

.p be a random sample of sia 

Let X ir J = 1.“V 1 . ... \ 

from the ith normal population wi 


p = E(X ) = 6, * Tty 

, . 2 (4.4) 

Var(X i j) = 0 

where the e.,Y , o 2 are unknown parameters and the t.. are know, 
constants (observations on a concomitant variable) and let 

H : B, = ... = 6 - & (say) . (4,5) 

o 1 p 


'i. = 5T * x ij' x .. = H l ] X ij' 

c.= — I x. t. = -■ It.., 

\j Pi lj 1. N ± . lj 

t = - E t. ., t = - Z l t. . 

. i n -it .. n 11 


i i iJ 


N = Z N i 


(4.6) 


We shall use these notations without any specific mention 
wherever necessary in this chapter. In the notation of 
Section 3.1, the parametric space has dimension s = p + 1 an ^ 
the parametric space ^ under H has dimension s - r = 2. Under 


one must minimize 


m (x ij- e i- Yt ij )2 - 


(4.7) 


which for fixed Y leads to 8 = x v* „ ^ sUin 

. x . - yt . and reduces the sum 
squares to x i 


i j U ij " X i. ' Y(t ij - tf.)) 2 . 
.Minimizing (4.6) with respect to Y we get 


(4.8) 




9 " 
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A 

Y 


Z 

i 


Z 

i 
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0. = 3C -ft u. . = g 
i i- i./ ij + 


Yt 

1J ' 


Since 


(4 .9 


X ij ' W lj = *ij - h - Yty 


= - t._) , 


we get 


z z £ I*,, - x. i 2 

1 • 


i j 


i j 


ij 


i j 1 * 


Under tt one minimizes 


(4.1< 


0) 


i z(x. r e - yt..) 


with respect to 3 and y, which yields, 

A ^ 

3 = x.. - Yt. ., 


Y = 


£ £(x..-x ) (t .-t ) 

i i iJ •• 1J •• 


£ £ (t.,-t ) 

i j U • • 


A 

A 


P. . = a+ yt. , . 

ij ij 


(4 


Hence 


[I Elx.^-x ) (Vi-tjl 


i I(x. ,-i. .) 2 = I £(x. .-x ) 

IJ IJ . , IJ - ■ 


1 j 


1 J 



(4 


and 


E .) 2 = 
i j ij 




IJ 




£ Zlx^-x 

i J 
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7 , is given by 

The statistic w m ( • 

- * 2 

. J <4, 

U .S=E?± ±J- ~ ~~2 ' 

P-1 I 

invariant test of H q rejects h 

and the uniformly most P° ^ ^ centra l p distribution with 
whenever m > c. Under H Q , 

N-p-1 degrees of freedom. Hence c - F i-a, p-1, N-p-1 
From (4.12), 

* J«*«- 5 u ,a 

£ \\ 2 

-Mtey -X.. -Tf^J-t..)) 

a 2 

= Z I(x., - x, - Y(t.. - t )) 

. ij i. 1J 1 * 

i J 

+ * 5 (x i. - x .. + ^ (t ij - ‘i. 1 ' ' t -- >>2 

-??«*« - J ij> 2 - *u )2 - 


Hence, for computational convenience, we can write 


T xx- " ( E 

N-p-1 XX i tt. 


xx " E, 


J xx" E. 


where 


= E Z (x. - x ) 2 . 


= v E '*■ 

j 


K 

j 






Table 4.1. Analy S i s 0 f 

- Class ^icatiS® Vari ^nce fot 0ne _, 


an HYsis 

0F Co ***iAaoE 



so ur ceS Sum of P r °auct s 

R ation d.f xx xt tt 


Ousted sum 
cl.f 




/ popul ations P_1 p 


of squares 

^LSl^uares 



Total 


If the hypothesis H i s reiect^ 
m ° J d ' one can determine by 

* multiple comparison methods which of the contrasts in the 6. 

are responsible for this. For any estimable linear parametric 

contrast 


p 1 P 

= Z c. 3. , with E c. = 0, 
i=l 11 i=l 1 

its estimate, using (4.9), i 


= Z c. 3 
i-i 1 


is given by 
- P 


p ^ P 

= Z c.x. -y Z c t 

1 i=l 1 1 * i=l 1 ' 


(4.16) 


Since 


ar(y) = a 2 ( Z z (t.^ . -t i^ * 

i j 

rom ( 4 . 16 ) we get 


2 (I ) 

C . 1 X • V 

a n | 1 m hi —*" *? » • 

r W = o (I S~ + s 

i j 


(4.17) 


' . (with^ne^ser^^ 

• 2 - 

e discussed the ana y cell wi the 

..Ifictlo. «« 1,! » “““ 

>,i„, , ,„d J 1.V1. 









■ ficati on « ith ° ne 

.. this classi ..J be 

analysis of ~ X .y *^ubl- « ith 

- depend 

normally di 3 + Yt.j ' 

„ ..<**> =: + “i + j J (4 -«> 

varlX^) - ° 2 ' „ u the effect of the 1th level of , 

where h is the ^nd -an. of the jth level of B 

satisfying 2 a i = 0, ‘ 3 J 13 . observations (known) 


j ^ ths 

satisfying .t “i” j . t - are Observations (known) 

i=l ° E 3 = o and the ^ 

satisfying j= l J 


.a., pt variable. Let 
the concomitant v - 


= a = 0 / 

H ft : a i = •*• I 

n _ = 8 = ^ • 

H S : i ” J 

In the notation of Section 3.1, the parametric space ^ has 

dimension s - I ♦ J and the parametric space * under the 

,. ' _ r- - .T+1 and under the hypothesis 

hypothesis H a has dimension s-r - J+l ana un e 

H Q has dimension I + 1. The minimum value of 

P 

under is obtained by minimizing 

l 2 (x. ._p_ a ._$ ,-yt. .) 2 , 
i j ' U M u i p j 1 ij' ' 

tfhich for fixed Y leads to 

' * • Yt ..' “i = x i. - x - t(t -1 ), 

*,=x -X -y(t ,-t ), " • ’• 

J -J .j (4.1 

ihd reduces the sum of squares to 


squares to 


(4 ’ 20) With respect to y we Qe . 


p = x.. - Yt.. 
a, = (x. - x ) 


>4 

k 


AN/\ Ly 


S IS 


,v: v >. 


18 ) 


-r 

/A 


i i • 

'X. ^ - 


8. = (x - x ) 

j A • J A ' 


Of 


Pi.= u+ a. + 3. 

S Hence 

A 

2 


~ Y (t, _ x 
~ Y(t _ ’* 

A • X. 

+ Yt. . 

* 


OF 


CO Var 


1 ance 


t8l 




2 E (X . - u. .) 

ij 1J 




■ 


=2 2(x. .-X. _ 

i j 1J ' 


i.-X .+x _)2. ) y 

-- 


= E 


xx 


m 


,2 

xi 

J tt 


E ' + - 
xt 

E, ' 



where 



E = L 

XX 

1 

2 (x. . 

j 1J 


E . = Z 
xt 

l 

£ (x. . 

j 1J 

§ 

E = £ 

tt 

l 

£ (t. . 

j 1J 

Wy 

Case a. 

Test 


c. -x ,4x ) (t. ,-t. -t ,+t ), 

l. .j . . ij i- -J •• 


(4.21) 


a 


a' 


have to minimize 


I 


l f(x ij -y-6 j -Yt. j ) 


a and Y/ which gives 
with respect to V, Pj 

?(t -t K 

- Y j .. 

• • 


\k. 

* 



- x 


( 4 . 22 ) 





Mm 
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Hence from (4.21) 

Z Z (x. . - H: 1 ) 2 
ij lj 1J 


’ o if J<* 

= z Z(x ij- x .j ) 2 " z z(t 

i j 1 


z< x ir x .j > 3 


CE xt + JW_ 

= E xx + A xx ' —* A tt ' 
where 

A xx - * J < x i.- x .. )2 ' A tt = l 

V f J(x i.- 3, - > • 

Hence, from (3.7), the test statistic w is given by 

i— ~ i 


(4 • 23) 


j" E J(x ij-^j )2 - £ I(x ij- S ij )2 

IJ-I-J i i J J i j J J 


“ " (i-i) 


z Z (x. 

i j « ^ 


and the uniformly most powerful invariant test of H rejects H 
whenever 0 

m > c, 

~ ' (4.24) 

where the constant c depends on the size a of the test. Since 

under H a , W has central P-distribution with (I-1, IJ-l-j) degr e € 
of freedom we get c = F 

1-a, 1-1,1 

£S ^- Testof V To find the test of H e we have to minimizi 
r ilx.j - w . Ytij) 2 

with respect to a , and y< which 





v , = y +s(. + Yt 
i j 1 i j * 

Hence, from (4.21), 

12<x ij ' V 2 

"if 1 1 

f = z Zfx^-x:.) 2 . (t ir t. 
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(4.23) 


icts H 

a 

(4.24) 

ince 

degrees 

i, n imize 


.25) f 


i j 1 J i 


CE .+ B ] 2 

= E + B_*t J 

XX XX B 

tt 

where 


(4.26) 


I B xx - J KX.j-x..) 2 , B tt = lilt -t..) 2 , 

1 J 

| ®xt “ z I(x -j- x --> tt.j-t..). (4 . 

Thus the uniformly most powerful invariant test of H rejects 


(4.27) 


Ho whenever 


Ij-I-j 





E 2 (x. .-y. .) 
i j U U 


where the constant c depends on the size a of the test. Since 
under H 0 , W has a central F-distribution with (J-l,IJ-I-J) 

p 

degrees of freedom we get c=F ^_a,j-l,IJ-I-J* 

, tj -i <= r-o-ipcted, one can determine by 

If the hypothesis H a is rejected, 

. *i the contrasts in the a. are 

multiple comparison methods whi 

. The same is true for the 

responsible for this rejection. 

hypothesis H fl . y \ , 

e • - Model: Let X = (X 1# ...» 

4.1.3. r^n^ral Linear_ Regressio— • 

b e normally distributed with 

s a .nA \ 


E(X) = x£ + Tx 


(4.28) 


C °V(X) = a 2 1 


,-way 


classification 


0 urce s 

ie tween 
Levels of 


3etwe e * 

Levels of 
3 

Error 


of P r °' 


cova ^ 1 


,duct s 


ance 


'T^us ted SUm 

Ad i scta§£^- 


- q< 


i-l A x* 


j-l 0 XX 
(1*1) 

*{J-1> E xx 


T-l «1 = * 4 

= 4t 

U-l- J E XX " E tt 


i£=i=£ y 4 

I 

xj-i-j q.I 

‘ J -!> Zl 


f + 


IJ-2 


rotal 

Error + 
levels of A 

Error + 
levels of B 


IJ-1 T : 

.J-J 


IJ-I 


q 3 - < A xx +E xx> 

' A tt +E tt 
<I 4 = (B xx +E xx> 

~ B ^ +E tt 


where X is the design matrix of dimension 1 x p, ! 

T = (t. ,) is an N x k matrix whose ith column 

-i = (t il' *’" T iN ) ' 

represents the N observations on the ith concomitant variable 
~ ^i#. •y^) 1 is the vector of regression coefficients, 
me analysis of the linear model (4.28) can be done in the same 
*y as the analysis of the linear model (3.9) in Chanter 3 No 

new problem arises in the adjustment f P 

in the estimation of naramer concomitant variables 

hypotheses. We shall give onl S IT ^ testing o£ linear 
pcoblen '- 6 oom PUtational aspects of the 




maximum likelihood est . m 

estimate („ hi<?h 


is the same as the 




< cal 
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least squares estimate) of (ft v\ 

. \—*-0 is obtained by 

s 2 = (X - XB - TY) . (5 . X 8 _ _ 


minimizing 


Differentiating s with respect to 

(X/ T) ' (X,T) (|) = (X, T) -x , ' 

or equivalently, 

X'XP + X'Ty = X'x , 

T' X 3_ + T ' T Y = T ' x . 

A 

Let be the solution of 

** 

x'X 3. = X'x 


— aru ^ X we obtain , 


(4.29) 


(4.30) 


ignoring the concomitant variables and let j}. be the solution 

of 1 


(4.32) 


X' Xj5 i = X' T ± , i = 1, ... ,k . (4.31) 

Define the residual sums of squares and products by 

R oo = ( .s - x io } 1 ( £ - x io } = s’* - i 0 x ’^ 

R ii = ( -i - x h) ’ ( ^i ■ x ii } = 3L[Si - liX'T., i = 1.k, 

R oi = (X - xjy ' (T ± - XB.) = x'T ± - £ X'T., (4.32) 

i 1/ ■ • ■ /k/ 

^ _ A 

R ij = - x3 ± ) ■ (Tj - XBj) = X'T., 

A 

From (4.29), multiplying the first equation by and 

subtracting from the ith component of the second equation we 
obtain 

R li Y l + *** +R ki Y k = R oi 7 i = 1 '*'* #k * (4.33) 

Let j = (y ,...,y ) • be a solution of (4.33) . The solution for £ 
1 k 

sa tisfyi n g (4.29) is seen to be 


A A A A 

= ~o " Y 1 “1 " ’•’ - Y k-k # 
v hich gives the adjustment to be made in J^-, 


due to the 


(4.34) 


the 
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concomitant variables. Under , 

squares R Q ^ is given 

2 • / yB TY) ' (x - ^ 

r ^ = min (x - X_p - J-JL^ 


residual (error) sum Q f 


ir 


£2 


= R 


oo 


k 

- £ y 


i=l 


i R oi ' 



where r = rank (X) . Now to test th e 
with N-r-k degrees of freedom/ ^ fchat ±s l'J 3 = where 

hypothesis H :L^B = 6^...,^ “ k' (0 .,0.) ' is a known 

x _: /t T > a o x k matrix and £ 1 k 


L = (L x .L^) is a P * 

vector/ we have to find 


= min ( 
^ L'0= 0 


x - XB - Tx) ' (x - X3. - T l) 


(4.36) 


Let R' # 
oo 


R 1 . z R. 
oi 1J 


. -,„ =1 slims of squares and products as 
be the residual sums o-*- 

restrictions L'3 = JL and let 

in (4.32) when 6 is subjected to restri - , _ _ 

.be the solutions of (4.33) using Rlj instead of R.j, 


1 

then 


R 2 = R io 


k 

_ S 

i=l 


YR' • / 
1 oi 


(4.37) 


and the hypothesis H^L'0 = £ is rejected whenever 
N-r-k 


0 ) = 


2 2 
R 2 “ R o 


- F l-a,r,N-r-k' 


r N-s-k 

where s is the rank of (X) and r is the degrees of freedom of the 
hypothesis L = _0 • 

We have thus far discussed the analysis of covariance of 
one-way and the two-way classifications and the regression model. 
The analysis of covariance of some specific designs will be given 
in Chapter 5. The analysis of Covariance of higher-way 
classifications and other designs are straightforward extensions 
of the analysis of linear models (Chapter 3) . In concluding this 
section we remark that in analysis of covariance the concomitant 
variate should be carefully selected and the results should be 
interpreted in the light of the particular concomitant variates 
used and the particular values of the variates obtained. 




ANALYSIS OF COVARIANCI 187 


Snedecor (19 34, 1946) and Coch 

ra l examples and discussed the BliSS (1948) P res ented 

^ses. Wishart (1936) obta ine a fiance 

{ naly sis of covaria nce. The reader • significance in 

M. jg 4 4), D e Lur Y (1948), Finney (i 946) ^ t0 W±lkS (1938 ' 

__ 1 us ’"' * Fisher. (194Q1 


and Nair (194 2) for relevant results 
analysis* 


Fisher. (1949) ,^ Federer (1963) . 
and discussions in covariance 


4.2. EFFICIENCY, OPTIMALITY OF DESIGNS 

When an experiment is laid out to estimate treatment 


estimate treatment effects with 
a certain nuirber of restrictions on allotment of treatments to the 
experimental units it is possible to estimate how much information 
would have been lost by using a smaller number of restrictions. 

Ihe efficiency of a particular design of experiments relative to 
another design of experiment with the same treatments and 
experimental material is defined as the efficiency of the estimate of 
any treatment in contrast from the particular design relative to 
the other design considered. In early stages of agricultural 
experimental on only a few experimental designs were at one's 
| disposal and efficiency was taken as a desirable criterion for 
p choosing the needed design for a given purpose. 

1 Now-a-days an experimenter with modern statistical techniques 
at his disposal can use a variety of experimental designs for a 
gijfen purpose (we shall discuss a large number of them in 
Chapters 5 and 6) . Hence, whenever the. conditions of an experiment 
allow the possibility of simultaneous existence of a number of 
^perimental designs, the question of selecting an appropriate 
design, a design which is easy to analyse and which has some optimum 
properties, naturally, arises. 

Our purpose here is not to state the problem of optimality of 
designs in the most general possible abstract setting nor to apply 
the discussion to the more complicated problems. Our primary aim 
is to stress certain ideas and principles which are most easily set 
f ° tth in simple situations. We shall be concerned with verifying 
Mlether or not a given design satisfies certain optimality 
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f Raid's decision 


NAL ySl 


crit^ ia ’ 


ANAL y S lS ° F VA Hn *. *pi rl (19 58, 1959) , Kiefer 

our t0 le gant =* lete *£ 

*-*v j a «*• ele8 

:rS^ itZ (19 rII ally aiotr^utea N-vector 

t,is r“P ic - x ) ' be a n0 

let S - . P (4 '.38) 


distributed N-vector wit ^ 


L et X = v "l r \4 

E(X } = X d- ^_ r , x, is a 

ah , m c omp° nents p i d 

cov(x> = with unkno design d and which is 

where £ = (3 1'"" , p , 4 ,h depends on _ bv A the set of 


where £ - J hi ch depends Denote by A the set of all 

rrJ l^ice by the he ' exp eri-nter and by A r (the 


kn ° Wn I to choice by the e*P«~ eXperil nenter and by A r ft 

Ixpet|i Cental designs ^ the class of all probability 

class of all randomized es ^ 6 is used by choosing a d 

A. A randomize th e selected 


meas ur es on 


A. A ran' 


from 4 according to this 


, is measure 6 and then using 


desi gn. 


, Y » , fx ) the information matrix of the 
shall call A = (X d ) > 


design d. 


A-optiim. 


= Lie, i _ i/• • */S/ 

i —i— 


where L. is a p-vector of known constants. Suppose that we are 


interested in statistical inferences concerning <j) ., j = l,...#s. 
Let A 1 be the class of designs in which the $ , are estimable and 
let o v d be the covariance matrix of the best* 1 linear estimator of 
the <frj for a d ini', For testing the hypothesis 


H 0‘ ^1 ‘ = 0, 
using a design d in 
function of the test t 


“ ' let %<« Z ) be the 

♦ over an alternatives 


infimum of the power 


j=l a' 


= 6 2 


Eb 

»ith ^ 

x, ' 


an d let F(d,«S 2 , 
size tv 


°0 be 


the £ 


u premum of B ,* 2 V , 

) over all tests )J> 01 







Defi nition 4 ^1. <M-opt ilT11 x 
Is said to be M_ r ■ — * 




^optimun, ln ' ; J° r ' 

r f(d *'*’ = s d “ P A * (a ' {2 - «> - 

sjf 

T nftfinition 4.2 . 
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0 < 


89 


a < 


1# 3 d esi gn d * 


/ 


) 


in A if 


A desi 


(4.39) 


gn d* is 


lim 


2 

6 0 


~B (d*, 6 2 , 


a) - 


sup (3 (d, 6 2 , a) _a) 
deA 


said to .hp t r\ *- ■ 

ue t-o-itimum 


= 1 


D efinition 4.3 . (D-cgti^ / A . 
D-optimum in A if e A 1 and Sl 


(4.40) 


if 


gn d* i s said tQ be 


l v d*l = Inf IV I- 

a deA' d 


Definition 4.4. ( E-optimum ) 

E-optimum in A if d* e A' 


(4.41) 


A design d* is said to be 


Of V 


imum in A if d* e A' and -r-f e 

. _ nd if the. maximum characterise 

r?* less than ojt aoua 1 tr 

d , , . . q 31 tC the maxi ™ characteristic root of 


ic root 


V' d for any deA 


Definition 4.5. ( A-optimum ) . A design d* i 

A-optimum in A if .a* e A» and if 

F 


s said to be 


tr V 


a* = min 
deA 


tr V 


d' 


It may be noted that these definitions do not depend on whether 
0 


a is known or not. 


M-optimality, the strongest and the least artificial among 
the above £ive criteria is extremely difficult to characterize 
even in very simple situations. To understand the implication of 
L-optimality let us consider the following example. 


Example 4.1. Consider a one-way analysis of variance P 

«th linear model as given in (4.38) . Suppose that with ran* o 
- 2, * „ 4 . o 2 = 1 and with the problem of tasting H = W 

Wo ■ a iri t-h two observations fro. 

Use the non-randondzed design interested to find a 

population. Suppose also that we ^ nirIUB powe r on each 
* St size c for H which maximizes th fa[fOUS 

ptour 6 2 + 6 2 = ° 2 - Hunt and St6in " 
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„ , _ Kiefer, 1956 ) showed that the usual test 
unpublished work (s dearee s of freedom does the job. 

based on chisquare with 9 

Let r be the rank of X,. Then, fro ” the 

d . ° r x P matrix L of rank r such r 

in Chapter 3, there exis d 3 only through L_3 whose 4 

that the distribution of X depends on _ r d- e * 

best linear aatinator is given hy where ^ is any soluti 08 f 

of AJ - *■«£.. I* r„ - p then V is the identity netnx. 


Now suppose that we first start with a design d, but due to 
missing observations we end up with the design matrix d' such th at 


A d' ~ ^ X d^' ’ ^ X d^ ^0 


A d , = VA d /, ^ - X Q 2 ' * 

Now, presumably, our equal interest (with respect to 3^, in the 
alternatives 3 ,^+ 3 0 ^ = 0 ^ should not be conveniently changed in the 
way to justify the use of X test based on 3 d + 23^ ' P Y the 


way to justify the use of X test based on 3 d ^ + 23^ ' the 

same accident which reduces us from d to d* . But if we want a 


size 


a test which maximizes the minimum power on the contour 

r\ r\ 


2 2 2 

3j + 3 2 = 6 then this test no longer satisfies this criterion 


and should not be used. 


2 

.In the above example if we let 6 tend to zero and rephrase 
our criterion in terms of the derivatives of the power function 
the problem reduces to that of finding a regular Neyman and 
Pearson's type c test (Neyman and Pearson, 1938). i n the case of 


tha design d, the appropriate test is then the usual one based on 
the X -distribution while in the case of design d' rejection region t 
is or the form a 3 V 2b6 2 >• ! where the positive constants a, 


b can be obtained numerically (see Kiefer, 1959). Thus for 

non-randomized designs if we want to find that a 

associated test which approximately maximize the ^ ^ 

small spheres about the null hypothesis (6 2 o» “ ^ “ 

that a regular type c region is to be usla C ° nClUd6 

chosen. This is called the L-optimum de whate ver design is 

b-optimality is a local Wr , • esign. obviously, thus, 

local version of M-optimality. 

l^'TnlZlZ Wald (1943) and “ e ia 

6 “ 13 E "optimum f or all >^ of variance F-test of 

F ° r point estimation an E-optim^ 




m 




* si9D TOnlmiZes th * naximum 0v 0F 

the variance of the best i, 61311 U !.. , . s UNCE ' 3t 

near estinaa. *s p Wlth E a 2 

An A-optimum desian ^ tor of j i = l 1 “ 1 of 

wst linear estimator of the avera^^v ' '’ 

parametric functions obtain’ I’l^s a »« thus o£ ance ° £ the 

transformation, a D-onti rom ths * bv nY S linear 

variance of the best linear eE ® the 9 eneral i2 a 

is known, it minimizes the voi at ° rS ° £ the *. ana th 6 2 

x a ne volume of the 1 d thus wh ®n a 2 

re91 ° n ° n *1.*S £or any confiaenc ' “ a£ ^nce 

of hypotheses a D-optimum ae sign P ° r 

has maximum Gaussian curvature at the " , P ° wer £ -ct £ °r 

locally unbiased tests of a given si2e ^thesis, among all 

Because of the ad hoc assumption tbit the F test 
corresponding test for 0 2 known< ^ ^ «-J* «- 

D-optimality and L-optimaiity .although loca i properties s _ _ 
reasonable than E-optimality. 

D-optimality has an appealing invariant property which is not 0 

possessed by any other criterion. Let . be related to 

, <P S by a nonsiiigular linear transformation. Then if d* 


ways be used, 
properties seem more 


is D-optimum for . 


, <p then it is also D-optimum for 

o 


99*. For testing of hypotheses D-optimality is also 
invariant under the group of nonsingular transformations. Kiefer 
(1958) , by means of examples has established that E-optimality is 
the least satisfactory of the criteria considered here. He. has 
also proved that if r d = rank of X d , is constant for d e A' and 
if d* is D-optimum and is a multiple of the identity 

matrix, then d* is E-optimum and L-optimum. 

For some other optimality criteria in testing if ^theses 


reader is referred to Giri 


and Kiefer (1964) 




3E 4 - 1A m 

h first lactation yield 

e following data relating t sires are taken from the 

units of dams and daughters o± ^ of Agriculture, 

of Kankrej herd maintained a IndeX of fi eld 

India (Reproduced from the 




f 

f 


v aS^ CE rie ul*‘ 

— 8 . 

■ ri rla‘p^- i0n ° ..laid O* «"* 


Table 4.3. J^ffere^^ 


|ANC e , cU 1«i 

of Agr na s) . 

JJ.D a5/ 

and daughters 


for 4 


Sine— 2 - 



Dam 


n'S yi©ifL 

Daughter 1 s 

153 

112 

276 

269 

260 

24 7 

299 

198 

298 

357 

266 

327 

281 

200 

18 6 

190 

188 

261 

168 

235 

251 

165 



261 


234 

190 

fiire-i 

123 

274 


313 

375 


374 

312 


320 

229 


532 

171 


203 

197 


269 

226 


153 

260 

166 

252 


228 

184 

383 


340 
251 
169 
208 
182 
221 
19 3 
312 
299 
188 


Si re-A 


340 
24 7 

193 
235 

252 • 
280 ' 

194 
169 
24 2 

253 


* *rr- data ^ *»■. ■—— 

Molds for djf thesis of eqwiity of / 38 Conoom itant varia) 
2 _ Ihe fo ;;— -es. daught ers i mean lactatii 

“ 819ht «M qoowtTr J 3 <Wisha « , 193e) 

Sex and ^ Of food Pigs <;^ t h e initia 

cc °rding to pen 

& 


UK 








4.4. Initial weight and growth rate in lb per week 
for 30 different pigs 


9.67 
* 8.8 2 
8.57 
9.20 
8.76 
10.42 
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9.94 


10.00 


9.75 


9.11 


8.51 


9.52 


9.24 


8.66 


9 .48 


8.50 


8.21 


7.63 


9.32 


9.34 


8 .90 


9.32 


Growth rate per weex 
(in lbs) 
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„ H,ta using the initial weights as concomitant 
to test the hypothesis Of equality of mean food 

effects on pigs. 

3 . In a simple linear regression of Y on X, suppose n points of 
X have to be selected (without error) over [a,b] and then values 
of y have to be observed corresponding to each selected x—point ; 
order to get efficient estimate of the regression coefficient 3 
in the linear model 

e(y|x = x) = a + 3( X -x), Var(Y|X = x) = 0 2 , 

v 

where x represents the mean of the x-points selected. Prove that 
an efficient design in the sense of minimizing Var(b) , where b 
is the usual least-square estimate of fi is provided by the 

zrz - the - ~ -*- - 

“ e ^ *> (-b) /2 . ttote ^ :rr e r t ? as above and ° ne 

different from each other.] ° f the x ' s must b « 
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o£ *^7* eSe Chapters 3 and 4) . First ^ 
t,Pe /^ 3 <r designs, then we sh all 

a n ° , iiH ea DJ ' 

, „s»s tal .-..ict 0,np . ,^V designs. 


in use in experiment^ 

* involves the linear 


and irie ar , et e 

b loc* designs- 

c-* 4 * *! of i” c0 ” P .xperi^nt is to know somethin, 

tbe ^ . , scienti £l ° _ tment s and to prove or 


known af<> ut ' ' boUt tha tn^-- 

nnypctAesns »° , g ^ for a group, a 


sci ent 
* a 9 roUp 


of a „. m of trea 


-treatment effects. The 


Cultural experiments one uses a plot of 1; 
once a sufficient number of experimental , 


it q ne 

is « 4e neral experime ntal unit will refer to the 
„ „r a class- The t< f“ th , t is being used to receive the 

£Tf e*P^” tal ^"Lnts under consideration. For 
application of ^ A in investigating the effect of 

“® ple ' if T Von potted plants, the potted plant will be the ■ 
various te&i lZe , _i experiments one uses a plot of 1« 

r:r--:.. —»- —-, 

units has been selected, it is then necessary* to allocate the treat® 
nents to the experimental units in a random fashion. This random f 

i 

allocation is necessary to protect against any bias in the 
experiment resulting from the influence of some extraneous unknown 
e affect ^ n 9 the experiment. For example in agricultural 
With regard'tQ f soil eXperimental (plots) are not homogeneous 

®sponses for treatment ' W6 are sure to get different 
treatments are allocate* 5 ' th€y * re alike ' if s P ecifiC 

^ Uty char acteri s ti cs t0 ^ perimental units with specific 

vaUdVr^’ «“*»’ano^Ir 13 SOme am ° Unt of repUOati °" 

,„ lstic statement- „ n also enable s one to attach 
^ 9) ha s stated thaf S ^ estimate d treatment differ^' 

° ° btain a vali d ^^^^ion and replication are 

^ of error variance. 
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Design of experiments is th 

placed at random on the e x perilne * t "" thoa *» «hich treatments are 
responses are estimated „ lth thfi ut " ltS in such a way that the 
f the randomization process is such that precision possible, if 
an equal chance of receiving each trea SVSry e ^P er imental unit has 
randomization. However, du e to tment, we call it a complete 

. • T " ne nature of 

restrictions on randomization ma b h experiment, 

facilitate a gain in precision Y & inherent in the design to 

Most experimental desiqns • 

experiments are block designs. The 

(plots) ara grouped in blocks of th We ex Perimental units 

number of experimental units LTn "" ^ iS ' 

fertility characteristics L e ^ 

e rert ility characteristics of 

different blocks are, however, usually different. If the size 
of the block is equal to the number of treatments and there is a 
random allocation of each treatment in each block, we call the 
design a complete block design. However, iT~-the number of 
treatments is so large that a full replication in each block 
nakes it too heterogeneous in fertility, it may be advisable to 
ji 1156 smaller, more homogeneous blocks not containing a full 
> replicate. Experimental designs with blocks containing an 
i incomplete replication of treatments are called incomplete block 
1 designs. In the category of complete block designs we shall 
consider randomized block and latin square designs, and in the 
category of incomplete block designs we shall consider lattice, 
balanced incomplete block and partially balanced incomplete block 

designs. 

5 *1. COMPLETELY RANDOMIZED DESIGN 

simplest of all designs having a random arrangement is the 
completely randomized design. It is defined to be one when a 
gr °up of v treatments are randomly allocated to the whole set of 
^Perimental units, without making any effort to group the latter 
*ny way for more homogeneity. 

k We nurrber of replications for difterent treatments need not 
be it may vary from treatment to treatment depending on the 




AX * 1 




j=l,,..,N^, i = denote the individual measurement of the 

jth experimental unit for the ith treatment and let 


x. = -t_Ex v 1 „„ 

l v- N i j ' x ~- i l] V 


“>™Hy ^strib!ted I1 inde V;ar a ableS ^ ^ aSSUmed t0 

E(x , Pendent random variables with 

E( V * p + « A 

Var(X ij> * ° 2 (5.1) 

**•'» *• the 0VeraU 

9t ‘ Sf ^ ¥ #1 . 0. “i ls «>• ith treatment effect 


7 = 0 t 1 w v 

1 1 Inwha t folUs , 


in this chapter we 
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the term measurement in the general 

/-mint or a 1 Se indicating that it 

be a count, grade or a measurement of a certain , 

cliara cteristic dependent on treatments. 

we shall use "S.S."to denote sum of squares and "d.f." to 
denote degrees of freedom. The model (5.1) corresponds to the 
Unear model of the one-way classification of Section 3 and the 
ANOVA table for testing H o : ai = ... = % = 0 is given by 

^_Table 5.1. ANOVA for completely randomized design 


Sources of 

va riation 

Between 

treatments 


S.S. 


q,= UN. (x -x ) 2 a 2 - 

1 • • • 


Expected mean 
S.S. _ 

1 Y M 2 

T~T I N.a. 


V— 1 q, 


Error 


Total 


q 2= l j S(x iJ- X i. ) 

ll (x, .-X ) 2 

ij •• 


From Section 3.5 the uniformly most powerful invariant test 
of H q (of level a ) rejects H Q whenever yfy ^ q l /q 2 ) l F a/ v-l,N-v* 

If, in practice, is rejected at a given level of significance 
oi, one has to use the multiple comparison methods of Chapter 3 to 
divide the entire set of treatments in different groups such that 
the treatments in the same group have the same treatment effects 
but the treatments in different groups have different treatment 
effects. The analysis of covariance of this design is the same 
» that of the one-way classification discussed in section 4.1.1. 


5 -2. RANDOMIZED BLOCK DESIGN 

The main disadvantage of completely randomized design is that it 
16 “sually suited only for a small number of treatments and for 
hoi »«eneous experimental material. In any experiment where a 
lac 9e number of treatments are used, a relatively large number of 
Omental units are required. This generally increases the 
?“ Ct *i°n among treatment responses and the completely randomised 
**** is not very helpful. If the whole of the experimental 













as 


' jjf 




posS ible to group 
it be * the units 

oinoge De0 “* eaC h such than units in 
of V Ther m° re the randomization 

ne aD ° bJ0 oK deSia t a within each 

^-rr/; iay ° uts - the 

s—t: <;r^ - ^.* - 

is are b •«* v leV els a » d ^ desi gn if «ithin 

««*' being treatments don ,ized b dom to the v 

Hayout is callsda at » y , ways o£ 

IcTbleek the a way that ®^b^ same probability of 

experimental ^ to the units h nts in different 

assigning the treatm and the as ^ Cox (1951) 

be '°l Statistically independen was originally 

ra table of ^f^Tn^ become popular in a large 

developed by Fisher (1926' 

number of field experiments. 

_ j . T.ot - X. ■ a 1 !#•••# 


itiber of fiexa 

: a ;;;'in the ith block. The corresponding random variable X,. 
u be assumed to be independently, normally distributed with ~ 


j> - » + b i + *j 
x.j) = o 2 


(5.2) 


1 *■ 

j) - « 

\i is the overall mean# t. is the jth treatment effect 

ying Z t = 0 and bi is the ith block effect satisfyii 
J J 

0. The hypotheses of interest are 


(5.3) 


contest of randomized block desinn n 
■ Sls is H t . However if h • 9 he most meaningful 

:e ttlat blocking is not ? ” aCC6pted «»« it gives 
^-t 1£ylng this with for future experimenta- 

lon per cell the anova tabll T " 7 CUssif Nation with one 

f0t test *ns « m and H _ is 
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—T “—— -____J^domi 2 ed block a 

ce of d f -- ck desi g» 

ati °2 —U- - s - s - -- 


ICS b -l q 1 =2v( x _ x .2 

i 1 - 

Treatments v -1 ( ^2 = (x -x ) 2 

j • J •. 


a ' + brr> Evb i (v-i) 2 i 
i q 3 

fj2, 1 v L 2 3? 

J J q 3 


Error (b-l) (v-1) q 3 = Z ^x^-x^-x j+x ) 2 

Total b v-i Z (x. .-x ) 2 

_ i j 

The hypothesis H b of equality of block effects i 
level of significance a if 


is rejected at the 


(v-1) -± > 


l-a,b- 1, (v-1) (b-l) 


(5.4) 


and the hypothesis of equality of treatment effects is rejected 
at the level of s-ignificance a if 

q 9 

(b-l) —- > F. . , 1W , (5.5) 

i q 3 —- 1-oc, v-1, (v-1) (b-l) 

If, in practice, H T is rejected at a given level of .significance 
a,one has to use the multiple comparison methods of Chapter 4 .to 
divide the entire set of treatments into different subgroups such 
that the treatments of the same subgroup have the same treatment 
effect and those in different subgroups have different treatment 
effects. From Section 3.7 it follows that these two tests are 
best invariant for the corresponding problems. 

5.2.2. Missing Observations : Allan and Wishart (1930) were the 

first to present formulae for estimating one missing observation 

the randomized block design and in the Latin square design (to 

* discussed in 5.3) . Yates (1933) showed that their formulae 

r cr ma res in ths ANOVA 

insulted in minimizing the error sum missing observation. 

tsbl e for the randomized block design wi several 

then presented.an iterative method of estimating several 


- 


mgm 


. v/C I Q OF VflK' 

„ block designs. »• also showed 

observations in randomised hypotheses in 

missing obser £or testing is still valid 

that the ANOVA missing obser , 

randomized block design wl * h ” observations is not large and 

.provided that the nu^er of^ £or the error sum of squares 
that the number of degree of f ^ , s redu oed by the number 

for the ordinary randomized M ^ or , block is 

of missing observations. In ^ esti|tiat e it and the 

completely missing ^ entlre iy the affected 

analysis should be earn 

treatment or block. . of jth treatment in 

Let us assume that the observation x.j 

the ith block is missing and let 


T! = 2 x ik ' 

1 . k IK 


= Z 
£ 


x flJ , T 


= Z Z 
K 

£^i,k^j 


Prom Table 5.2, the terms containing x.j in the error sum of 
squares of the ordinary randomized block design can be written as 

(T’ +x,,) 2 (T! +x, J 2 (!■.+X.J 2 


Thus for variation in the error Siam of squares ia minimum when 


(T • +x. .) 


(T! +*. .) 


(T ,+x. .) 


giving 


!P| 

V, 


bT ! + vT 1 - T ' 

1 . »j , 
(b-1) (v— 1) . 


(5.7) 


This value of x ±j is taken for the missing value and the 

approximate ANOVA table is completed in the ordinary fashion 
S except that the number of decrees n-F 

I s g°ares is reduced by 1, T ^ t f “ ’^ " 

r - -nance is known ™ ^ 

expected value of treatment mean square S6nSe that th6 

velue Of the error mean square under H tha " the eXpeCte ' 

fr ” «*• approximate analL, " “ ^ t6St ' 

Y is of variance table accept £ 
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- "° t0 C ° mpUte 6xact test 

placing in the approximate P-statistic T 1Ch is obta ined 

of sabres by the following accuratl tUt" aPPr ° Xlmate treat *”t 

atment sum of squares 

curate treatment sum of squares 
2 

v T' 

1 til k=l X * k ~ )_(error sum °* squares calculated from the 

tfi, approximate analysis of variance table) 

9 rn^ m* 2 2 9 

V + - + -^ + ^t +j ^ + -*4-) 


T 1 T 2 


A simple method when there are several missing observations 

is to replace as a first approximation each missing observation by 

the mean of the block in which it lies. The formula (5.7) is then 

used to get the second approximation in order, utilizing either 

the block mean or the new approximation if it has already been 

obtained for all missing values except the one being estimated. 

Such a procedure converge rapidly in general. If x. . is missing, 
a 1 j 


the estimate t^ of tj is given by 


t. = A (T 1 .+x. .) 
j b .J ij 


= £ [TV + 
b .J 


vT .+bT! -T' 

• J i •_. 


(b-1) (v— 1) 


(5.8) 


Thug the variance of the estimator t. corresponding to the 

A -* 

estimate tj is given by 


Var( V = iettt 


[1 + bi^iT ] 


(5.9) 


= V C1 ■^TTT^T ] 

^viously the variance of the estimator of any other 
effect t k without missing observations is given by 


Var (t J = Si 

k b 


F Urth 


>ermore for any treatment contrast ^ a j j 


,t, with . I 


:e the ith treatment has one 


missing observation the 


iance of the estimator 1 a-t. 9 iven ^ 
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<6 V \ 

2 2 2 & _ (1 + - 1 ““ ' 

Var< * ^ a J + ^ b 0-« 


(5.10) 


i consider Problem 19 of Exercise 3. i 
5 . 1 . a worked out ExBm Z-2- t he experimental layout of thi s 

”* follo, ' ±n9 diasram 11 trea tmente numbered 1,2.9. 

two-way classification with 9 treatme 

The treatments are randomly assigned to experimental units within 
blocks. The corresponding yields of rabi crop are given below the 


S our ceS 

a bi <><** 

a 5 


treatment number, 


Block 1 


Block 2 


Err° r 

Total, 


5.2.3. 

randomi: 


5.50 5.62 5.12 5.00 4.00 3.50 5.50 6.12 5.00 4.75 


two-way 

K, ■■ 


4.1, thi 


%. j-, 

of the : 


[•] 
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variance 


I sources ot 
jt yariati ° n 

*** Blocks 

, treatments 
Error 

Total_ 


S.S. / 

Mean s.s. 

3.0131 

1.0044 

5.9305 

0.6589 

10.9553 

.4057 

19.8989 



1.62 


5 . 2 .3. Analysis of Covar iance: The ana lysis of COV ariance of the 
randomized block design with one concomitant variable is that of a 

” C aSS : fiCatl ° n With »“ ° b —tion per cell, discnssed in 
4 1 the levels of the factor A being blocks <b=I) and the levels 
of the factor B being treatments ( v = j) . 

5 - 2 * 4 * -M ndomization Model of th e Randoms Block np B i r 

(a) Est imation . In the linear model (5.2) for the analysis of the 
randomized block design we have assumed that the errors in the 
observations x ±j are independently, normally distributed with the 
same variance a . Here we shall deal with a linear model 
(called the randomization model) which does not make any such 
assumptions about errors, but takes into account the actual 
randomization employed in assigning the treatments to the 
experimental, units. We first formulate such a randomization model 
or the randomized block design and then discuss the problem of 
^timation and testing with such a model. Randomization models 
Wfire first formulated by Neyman (19 23) for the completely 
tdomized block design, and later (Neyman, ( 19 33)), for the 
a * d omized block design. 

t r Cons ider the set up of a randomized block design where v 
ar e trnents are compared on b experimental plots and that vb plots 
\i gr ° uped into b blocks of v plots each such that in each block 
bl 0C k Inents are assigned to the v plots independently for the b 
th e S ° ^ at (vl) b assignments of treatments to units have 

11,6 P r °bability of being used in the experiment. 





■r- 


wm 


3k 


WEflP „ denote the observed and the true 

«<= W H*n plot k in the 1th block. These «k 

with treatment j ^ ^ the observed and the expected v H 

fir quantities are teg satisfying 


m 


^ . - r p ^eciiirciGCi 

quantities are v satisfying 

I ' L random measurements X ijk 


Jf -V ■ . 

1 E(x ijk ) = M iJ k # * 5 *1] 

_ i .be Let 
= v ' X " . . 

x ijk * P iJ k + @i J k . (5 ‘l2 

e . are called technical errors of measurements and ar e 

regarded as random variables with 

E(e ijk ) = ° * . (5.1; 


E(e ijk ) = ° * . (5.1; 

Using the usual dot notations for means (Chapter 3) We ca n w 

“uk '"'i'V Y u + "u* ’ 

*uk = , ”i'V r ij* n ijk + e uk 
where 

M = V , a. = v. - y i t, = u 


a. = u 

i i.. 


T ij =1, ij. - V. - “.J. 


- U , t = u 
• • • J 


"ijk " “ijk “ P ij. • 

The quantities h. are called „.i. 

1Jk re called unit errors (unknown constants) 

Obviously we do not observe the concentua, 

«~ely we observe the measurement x. . of the^eT^^ ^ 

ith block. From (5.14) „e can write 1 " 1 at »ebt j i„ tl 

X iJ = 11 + “i + tj + T + n. . + e . 

with 1J ij (5 

i “i " °« £ t . 0, £ T . - 

J J j Y ij " 1 Y ij = 0 , 

n ii = n J 1 J 

k ^ k n ijk ' e ij = Zo e 

, 2 k 1 ^ k *-J k (5 

where 

ndom tables 6 

'l. If the Jth . 1Jk * fle£ined ^ 

“ijk = . ilh block, eatlrent occurs on plot k in the 


ot herwis e . 


(5. 
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The j° int distribution of <5^.^ is completely determined by the 
randomization described above and is independent of the 

distribution of the e ijk . 


LeHSS. 


E^ijk 5 = V 


(ii) E ( 6 ijk 6 i'j'k' ) * 72 ' if i ^ i' / 


(iii) E (5 ±J]c (S i j , k .) - 


v -1 6 


kk' ' 


if j = j* , 


v 1 (v-1) 1 (l-6 kkl ), if j ji j' , 


v/here ^ kk i “ i • if k — k' and is zero otherwise. 


Proof. 


u) E(« k ) = P« = l}-=i. 


(ii) Since for i ^ i' the randomizations in the ith and the 
i'th blocks are independent. 


E (6 6 ) = p {6 6 =1} 

M ijk i'j'k 1 ' ijk i'j'k* J 


jk i'j 


" p{6 ijk lf 6 i'j'k' “ 


= 1 } 


= —r if ( i ^ i') . 

■i » 


(iii) E (<5. 6 ) =P{<S.. 6..,. = 

ijk ij 'k' ijk lj 'k ' 


= 1 } 


= P{ W - 1|6 ijk - 1} p{s ijk ■ 1} 


V- 6 kk , . if J = j ■ > 
v' 1 (v-l)' 1 (l-6.. ,) , if j * V 


Since Z t\. =0, from (5.16) we get 


Q.E.D. 


E (n ij ) = o 


(5.18) 



.‘Ji 




*.« 


w*: ■ 

rWj «* 


QP VARIANCE 


.: J,'K' 



m*e analysis or 

ssfv y a re 

the * .v and the 6ijk 
5 »v since the ijk 

(5.16) we get 


independent and E (e^^) = q 


For any 


contrast <p 


% c t. with l c j - 0 an unbiased 
- , j J j 


estimate of <t> is 


■ * c j x -j 


fL (5.15) , (5.18) and (5.19), we get 


E(X ,) = P + t j * 

though theoretically the Var(^) can be computed from the moments 
of 6 . . as given in Lemma 5.1, it is complicated and involves 
the and the variances and covariances of the Some 

simplification is introduced by assuming complete additivity in 
the linear model which we will discuss in connection with the 
efficiency of the randomized block design relative to the 
completely randomized design. It is not possible to find an 
unbiased estimate of Var(<|>) without introducing some such 
simplifying assumption. 

A 

An overestimate of Var(4>) can be obtained by first estimating 
$ separately in each block by 


<P. = I c .x. . , 

1 j J iJ 


i - 1, Then finding 

s = bTI z (<P ± - <t> ) 2 

i 

A A 

where <p = b" 1 I <p 

• i 

Lemma 5 . 2 . 


(5.20) 


( 5 . 21 ) 


E ( 5 “ ) = b“ * (b- 1 ) “1 v i 2 * 

b \o i) i x t + Var(<J,) 


(c. 22) 
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<f> + 'K + f. . 

1 1 * 


J ij 


yroin (5.18) - (5 . 19 , we aet , 

are associated with diff„ V = 0. 

different blocks are mutual!* bl °° ks and the rlT*' Since the 

f .1st. UtUall y Independent "/"“^nations tn 

• b i 1 ' from (5.231 „..v 


«♦* - t> 2 = * + f..f , 

i i 1 i 


Thus 
E ( Z(i. 


~ + I (f,. f ,2 

1 i 1 . + 2 E- (f f > A 

1 1 • i 

k-*.' 2 = j if+ZiZ U .-t ,2, 

1 i 1 • 

2 + E( ^ f 2 - b f 2 ) 
i • 


- 1 \ + 2 Var (f.) - “ Var( z f ; 

i i 1 b . i 

x l 

= Z + z Var (f ) (l_b~ 1 ) 


«v||f 


= Z A i + b 2 (l-b _I ) Var ((j>) 


as from (5.20) 


7 A - 

^ - <f> and Var (<f>) = —r £ Var(f ) . 

• “ j * 


b i 


From (5.24) we get 


(5.2- 


E( f* ) = b ” 1 (b- 1 ) ~ 1 Z A? + Var(J) . 

i 

r ? ai l the block-treatment interactions Yjj are zero ' 


Q.E.: 


** a11 the block-treatment interactior 
8 ^b i s an unbiased estimate of Var(<J>) 
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„ nd the mathematical expectations of Q 1 /( b - 1) 

Let “ nOW / “ " defined in Table 5.2, under the 

Qj /(u-l) and 0 3 / shall ass ume here that the e are 

randomization model. 

uncorrelated. 

v . ) = o for all i,'j * nd the 6 iik are 
Now/ since E(e ij ; - u - - 1J 

_ _ we Qjgt from Lemma 5.1 

independent of the e ijk # W 9 

Covle.j.e^j,) =E(I « ijk e ljk - ^ «i • j -k '*1' j 'k ■ ’ 

= £ ^ E(6 iJk « i .j, k ,)E<e iJk e i , j , k ,) 

k 1 


- E £ ^i^jj'^k' Var(e ijk ,E(6 ijk { i'j'k- ) 

k k' 


= 6. . . 6. ., v" 1 E Var(e. ) . 

ii' j j • - 'nk 


(5.25) 


(5.26) 


From (5.25) we conclude that if the e. .. are uncorrelated, then 

' , J- J A 

the e^j are uncorrelated. 

r _j_ n 2 _ 1 - ,2 2_1 v .2 

Let B ' fc^l Z b i' a T " vU Z b j ' 

J J 

_ 1 n . 2 ‘ 

BT lb-1) (V-l) “ Y ij ' 

°e = Z J var < e i; j> 
l j 

= -J- 1 £ 1 2 

Ub 1 J * iJk ' (5.26 

ct 2 _ 1 „ „ alB „ 

u J * ^i.k^x..' 

"Blfcir J k e ik ' (say), 

b ( v -l) 2 i j k <P ijk- JJ i.k-> 1 1 j , + (jj. )2 

o 1 • • 

1 

TU = --—ar Z 2 V v 2 

b( v-l) 2 i j Zv ijk * (say). 


° an a fter S J ^ 


—-x cai errors are 

str aightf orv , ar^ k,,±. j_ 


# 


“i^b 
Br th. l) 

i 


d ' then 


(5.26) 



E *PER| 


Me NTal 



filiations the foli 0wing ^ 


e %na 


L ermna_5^1j 


i = v a 2 


" AN *I-Ysis 2|1 


,7^ = V0 B + °* v + 0 a ; 

/ * ( V^’ = b °T + °U + ' r V 2 ) 0 2 2 

t °3 . .2 2 TU ^ ' 

I B( (b-1) (v-1) ° BT + °U + v ~ 1 (V-2) a 2 2 

o TU + o 2 

I Fr om Lemma 5.3if a =a 2 e 

T BT = °- Then 
Q 2 Qi 

I b( ^rr' ) = E ( --—-) 

(b-1) (v-i) 

I even if £ 0 . 


(b) Testing. (Permutation test) tt , 

--• Under th* 

exact tests of hypotheses in th* randomization model 

rne randomized blorv ,w. • 

whenever the joint distribution of th* k Sign exist 

rr is r ins — - -tzzzzzz 

«the observations. onso1 


WS Sha11 flrst give the general idea of the permutation test 
K a hypothesis H. Permutation tests are constructed when the 
^distribution of the random observations is invariant under a group 
/ permutations of these observations. Let X = (X^,...^) ' be a 
I vector of random observations and let x = (x^, .../X^' be one of 
| its valNjes. Suppose there exists a group G of L permutations of 
the N components of X with the following invariant property. 

I Let Six) = {gx: g e G> be the set of L sample observations 

derated by x, applying the group of permutations G to 2- * • 

I ro . “ . _ . _ iven the condition that 

, q E Ulred Sy,rmetry pr0perty 1S . “ ' ® under the hypothesis H that 

I *» e 3 (x) / the conditional probab Y ^ i/l for 

I ^ ta ^es on any particular one of the L v ' all values x of 

! Of the L^elerrents inS(x) and this holds 

f andom vector X. , . nni ficance o is 


* Permutation test of H ah 


level of sidnificance 



analysis OF 


ANCb 


byaS ubsetS'(x) of 8a ^ 
S( * )/ equal to oL and the 


ibl e sc or eq uaJ - — e 

defined, « 1S ^^if the 2 

whose nu*er of • lf and on ■ * defined by means ot „ . 

hvoothesis H is - _..„ =et s' <*> lS 9 test statistics 


hypothesis 

is in S'(J0 


is in S'(>0. situatl ““" ployed in the analysis of 

-* *° *Z action of normality and 

eOMide e technique with the ad ough treatment of the 

r^ndence of £ referred to Scheffe 

ssr;s r — 

, an one-^ay 

EaamEitil- consl f populations corresponding to two 
observations from each o ran dom observation from the 

treatments. Denote by the J 

ith population , _ 5 

- 8 x = 7, x „ = os x 2 3 ' 

x = 9, x.» = 4, X 13 " ' 21 22 

11 x ) . Suppose that we are 

interested^by^maans^f' sample' observations (9.4,8,' and <7.6.5,' 
in testing the hypothesis H of the equality of treatment effects, 
we shall now find the permutation test based on the statistic 


• itua 


tions 


= 6 , x. 


|t (x) 


3 3 

=1 £ x - I 
j = l lj J =1 


There are twenty different ways of dividing 6 objects into 
groups of 3. Thus in S (x) there are 20 different elements which 
give different values of T(x) . 

For the observed x,T(x) = 3. Under H , from the Table / 

5.4 P{ |t(x) | >3) = .70. 

If we assume that the are independently normally 
distributed and apply the analysis of variance technique to 
analyse the data we get Table 5.5. In both the cases, we 
accept the hypothesis H with high probability. 

moc/lX! m:; t rr ation tests in the rana ° mizea 

normality, independence and depen<i ° n the as s'™P tion ° £ „ e 

which are assumed i n (5 2 ) 6 homo 9eneity of the error varied 

in which the observatio c °nsider the randomized block deS.’5 n 

X iJ on jth treatment (1=1. v) 10 



r- 
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for di£ . 


Table 5.4. 


*i2 /X 13 

- - - ^^Permutations 

- -- 



_ _ Table 5.5. Analysis of Variance — 

Sources of variation d.f. _ sum of square s^ 


treatments 1 

Within treatments 4 

Total_5___ 

the 1th block (i = . b) has the 

'1( * u + a, t t, + y jh + n ii + e ij 


following structure 
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n = E 6 . n. .■ 

ij ljk ijk X J 


= «4 4V®4 


ijk e ijk' 


I t . o, I®i = 0. " w * 

j J i J . 1 

Under the hypothesis H^: t^ = • •• - t v - the distr ibutio.^ 
X. . does not possess this invariant property. However such 
invariant property can be achieved if we refine as the 
hypothesis H^, that there is absolutely no difference between 


v Y = E Y. . = 0 

r« i 


t = ... = t v = 0, the distribute 

.__j ^ Mi. -\r _ 


hypothesis that there is absolut( 
treatment effects in the sense that 


°T ~ °TB - °TU * ° 


the joint distribution of the e^ does not depend on how 

f roa+* n •% .. __ J _._ — -i-\ ____ ‘ _ _ , n 


xj - -‘tuw • 

treatments are assigned to the bv experimental unite; 

“ • Und 


x.j = u + « ± + « iJk (6. k + u lk ). 


' i 

I i 


-here the 6^ are defied in (5.26) and the u play the rol e of 
'ijk whlch ar ® now assumed not to depend on how the treatm 
ssigned to the experimental units. By def inition, E (u ) -o'* “* 
Ut, otherwise the joint distribution of the U. k is arbitrary. 

Let G be the set of all permutations within blocks and tv 
■ e obviously (v-) b elements in G. Let x = ( x ' there 

'•'V' and let - .. x lv. \ v j 

x) = (gx: g e g) t' jj 

■ the set of all (Vi ) 13 e=rn n 

rmutations in G . Let Us f fr ° n * a PPlying all 
ndomization incorporated “ ** 

1,6 the same conditional probabilitv * U samples in s(x) 

in S(x,. obahility given that ^ given ^ ] 

Leb x be the random vector that taw I 

'eriment. Let fc takes ™ the value „ .v- 



° n the value x in the 


= 1 6 . z 

k iJk ik 

" “* “* — var iables takino 


(5.28) 
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*1 


unit 


and 


J 


that is, z ik is thg R'M Ent 

the 6 iJk are defin ea SBr ^tl 
,, ln (5 

and by z the val Ue ^ ’"■' 2 bx ) > 1 

r assignment t raa *»**■*•• o n tn 

randomization deter mines ° t he A 

linear tran sfo r m ati on (5 IJk h ebCe ^ “"Its ln th f rtlc *« 
probability. In other ^ • an d SUch ^ 

the coordinates of £ by oj// ^ ^ ^ 

permutations have equal proh " the »">««!« sl *“* fo ™« fro* 

= S (z), we can say that "iv/n Si “* ‘ W1 

probability, finally s (x) ^ S ^ 

X e S (x) . Hence „e conclude ttL 1 ' Sfe) i£ “* <*ly if 
have the same conditional probability 6 " ~ ' " all - X ins| - x > 




\ 




r 


b: 


We now n «ed to find out the subset q.i \ 
be rejected if the observed x falls in s-(x) test ^ 

criterion. Let us consider the test statistic f = (b-l) q /q 
which was used in Table 5.2 for testing H r Since 2 3 

q ? +q^= Z Z x 2 .-bv x 2 - E v(x. -x ) 2 , 

2 3 i j U *• i I- ' 

has a constant value for all elements in S(x) .• it follows 
ly increasing function of 
2 


,/.,-? 2+q 3 

l °that F is a strict 

t 


Q 0 = E b (X ,-X ) 


1 

b 


r 


a hd hence of 


So 


ij 


E (E X_ L j) 2 - bVX ?. 

j 1 13 2 

E ( 2 X,,) 

test statistic 

T(X) = E (E X.j) 2 
j 


the permutation 


test based 


on 


the 


teSt ^“Jtreat^ 5 ' 


sed 


on 


the 


statistic t. 


the 


i 


there 


are 


\ 


is equivalent to the P® r ™^ ^ ^ undeC RJ. 

Since T (X) is invarian in sets ba biH ty theS e 

1 " OJi ^ 3 


(^l) 


eq 

- , b y T(X> Wi 

assumed oy large 

‘.A ln Practice (b !) b_1 iS o£ apP r 

£f v ^ues to find the * 1 *^ find »» 

^ ^practicable an d we 066 


f T < 2 > 


i 


under H' 





JEar..*« F VAR ' equivalent to the test ^ 

■ a on F is alS ° and v haS central beta dlst ttb tt 

I The teat teS tina H I shall consider here the 

th. *“ Girl-1 9751 • / 1C V (which is equivalent * 

tion under % l on the at ^ main reason for this U 

permutation tes^ ^ , itsel under the penrtt.^ 

basing on , constant Ration distribution 0{ 

rr rSu«- —- v with «* ^ 

^ nderH t b c y fT-tr 9 al beta distribution. 

tw° moinen let us assume here that ther e 


. _ n f a cent-*— 

momen let US assume here that ther e 

SSSS-Ui- In additi ° da t°is? „l = 0, hence all^ = 0. Pt0< 

irr::- ^ tionai assumption ° e=0 ‘ 

2 

- » - n «- * (5.29) 


2 

Q 2 ) = (v- 1 ) °u ' 
q 3 ) = (b- 1 ) (v-D ° v • 


j = (d-i; iv-x/ • 

1 , n + 0 in the set S (x) may be calculated 

constant value of Q 2 U 3 


q 3 = E(Q 2+ Q 3 ) 


(5.30) 


(5.31) 


= b (v-l) a* 

* E v £ "» * ' <5-: 

i k 

!5.29) under HJ. and a 2 = 0 , 

T e 

(q^)- 1 E(Q 2 ) = b- 1 (5. 

r(v) has been calculated by Welch (19 37) and verified by 
n (1937) as ' 

= 2b- 2 (v_i)-l (1 _ b -2 -4 j. 0 4 

u i U.i (5. 

. = (V-l) -* 1 j; q2 

k ilc ‘ Since it: involves complicated 

, he ° nS ^ fr ° m pres enting it here. 

Present assumption the bWv 

error f ro m the obsprv^ • variances can be calculs 

nervations x. .. o,_ 


2b" 2 


(5.32) 


2 


rpmp 




* to 
is 

tiofi 
n of 

f "i r-r. 


. J <x ij - x i.» • 


pr off 


(5.22) - (5.23) 


, _ 2 ( b-T) p 2 

,ar(V) % 3 (v.i) 11 'ir 1 > 

inhere 

e! = E (a 3 - CT^) 2/ b ( b _ 1} 4 

b j_ u • 1 u U * 


? s 2 
1 1 • 


( 2 S.) 

•i 1 
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(5.33 


(5.34) 


(5.35) 


5 2 = £ (x. . - X ) 


Since under the normality assumption V is distributed as central 
beta we shall assume that under and o 2 = 0 the distribution of 
V under the randomization model is also central beta with 
parameters (or degrees of freedom) v 2 , say. The values of 

v^, v 2 ate obtained by equating the mean and the variance of V 
with the mean and variance of the assumed beta distribution. 

Since the mean and the variance of a central beta random 


variable X with parameters 


E(X) = VT<T ' var (*> = •-72, 1 2 ) 

V l +V 2 ( V l +V 2 } (V 1 +V 2 J 

from (5.31), (5.34) and (5.36) we get 

v 1 =<J ) (v-l) / v =<f»(v_l) (b- 1 ) / 


v +v 

1 2 


, Var(X) = 


2V iV 


(5.36) 


(5.37) 


♦ = ( 1 - ^V 1 - 2 b" 1 (v-l )" 1 • the 

b r ui on the basis of the 

'^ le approximate permutation tes ^served value of V for 

fistic V rejects H* whenever ^ ^ ^ {1 . a) percent 

^served sample x is greater than ^ parameters V 2 

of the central beta distributi 


Siven 


in ( 5 . 37 ) . 


buti° n 


basis 


of the 


This approximation 


of beta dlstrX c i 0 sely with 


;p ect 


-us approxiiw^- quite 

t ltst £ \T also flts> ^ 

^ two moments of V ai 




5 

IH 


1 th 

an d the deviation t * 


deereas: 


, eS b P ne i 

£ v unless and the deviation t _ 

, 4 th —»*. °‘ tl y from un ty ± . , is 

■viates signlf > 90 , Vj > 2 ' l-“- v 1 ' v 2 

Since for 1-“ - ‘ ^ valu es of the F-stati stl< . 

function of f 4 V una dj usted degrees of freed. 


unity ana - 

> 2, F i-a/ v i # v 2 


isa 

T- ve Since for l'“ - ' tfte va lues of the F-statis Uc 

\ iBB function of * x usted degrees of freed, 

deC Ti::! significance with « q£ freedom given in (5.37,. 


— .£ ic anc«= -- jrreeuw*" - — "■'•o/i 

which miss si adjusted degree than the unadjusted 

may achieve it Witn more senS1 tiv a 

thus the permutation te , f a random variable s central 

„or»l theory test. Note t >£ freedom then VjX/v^u, 

beta distribution with 2 degrees of freedom and i s a 

has central F-distribution With i 2 

strictly in “ eaSin9 f u UnCt hl an arbitrary distribution with 
rase (ill . ine ik 

''"'itVrespect to the conditional distribution of the X.. given 
the values u £k of 0 *. from (5.37,, we can write under «• 

x ij = V ' + “i + k S iJ k8£lc ' < 5 ‘38) 


where 
P' = M + 


u , a! = a. + u. -u , 
;. i i i. 


e, ik = e ik + u ik - u i.' 

Ea', = 0 , £9' = 0 for all i. 

i 1 k lK 

Hence the observations x ± . are distributed conditionally like 
those m (5.27) with (^^, 6 ^) replaced by (y • , a'. 0* ). Thus 

the conditional distribution of v under the randomization model 

-VLn ““ 3S the distr ibution of v (in case i) where 
ik 0 and 6 ik are replaced by 6 - Tt , „ . , 

conditional case the *an and the varia P “ tlCUlar in thiS 

by (5 ' 31) ( 5 . 34 ) respectively T n °‘ V ^ 

conditional permutational test • US the ap P roximation t0 th 

theory beta test (or equivai ,, S by car rying out the norma 

freedom adjusted as ^ * ntl * the F-test) with degrees of 
case* ls a random variable factor <t>. Actually in thi 

:r;“ «“y «- - 

y o£ type i error est conditionally controls tk 

tnat it® 

12e is approximately 0 


*•« the ““ Wil1 be tru. uncondit. - """ THE '« AN ALysis 2 „ 

discussions and results in th <Jltl °«ally. For 

sche££ e (1967, end Hoteli lng (19 ^‘ feeder is ^ ^ 

5 - 2 - 5 ’ Ef f 1ClenCY - aS ^-aS-S£tl a el ltv - 
* sha11 n ° W ° 0nSider ^ 

randomised block design relative to th “Polity of the 

design in which the treatments ar om Pletely randomized 

experimental units subject to a restrict ne<3 ^ *** ** 

occurs b times in such a wav that ,, 1Cn that eaoh treatment 

probability. We shall also assume^th ” 5 T* ^ 

netween experimental units and treatments in the co^etelT 

randomized design are zero. This will irnn , v .w • 

x in ® i y that in any grouping 

of experimental units into blocks the treatment-block 
interactions Yij and the treatment-unit interactions within a 
block v_ k in (5.26) are all zero, that is, 

°BT ~ °TU 0 ' (5.3?) 

furthermore, for all i,j,k,the technical errors e^^ are 
additive, that is. 


e. = t. ... + u. ... , 

ljk ljk ljk 

where the t^ k are associated with treatments 
associated with experimental units satisfying 

E(ti jk )=°, E(U . jk )=0 

and the t. .. and the u are independent of < 
ljk iJ K 

This permits us to write 

Var(e ijk> = °t.j + °u.ik • 

where 

- a 2 

v ar (t ) = a 2 , Var(u. .J ~ u.ik 

. ljk t * j lj l ,re using here tl 


(5.40) 


and the u. 

1 


(5.41] 


each other and of tl 


(5.42 


N °te that for convenience we are u> 

the random variable and its value. 


The Equation 
^omes 

\j * U + a. + t 


[5.15) f° r 


;Ompl ete1 ^ 


the same notation 

randomized desifln 


■ 'M 

.kii5&T 


t l-jp, 

'&M 
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r 4- = 0 t 

Z a. = o , “ 7j 

-1 V •* ' 

as all v ijk are zero, and 

•a - * 4 '«»*ijK " k s ij^ (t ij^ +Ui J k> 


Lemma 5.4 . 


(i) Cov(e ij ,e i , j ,) = 6 ii' 6 jj ' (a t.i +a u.i ) .' 

2 -1 v _2 
wh e re cr ua = v E a u .ik ' 


(ii) Cov(Ti i j#e il j,) = 0, 


if j * J 1 , 


' 6 ii' d'V 1; °u,i ' if j = J' ' 

J J Ki. v <i ^ J ^ J ’ > 

where o 2 = (w- 1 ) -1 1 e 2 , . 

U -i k ^ 

Proof, (i) Since E(e. ,) = o , 

Cov(e ij' e i'j'> = E < e ij .e i . J .) ' 

= Z 1 E(6. 6, ) pr 

k k' Uk I'j'k' 1 E(e ijk e l.j,k 

= 6 U' S jj. v' 1 E var(e ) = { , -1 2 2 

k !Jk n* j j • v 2(a. .+o z ) 

_ . ~ ' JJ k t.i u.ik 7 

- S “'V ( 0 t.i + < 1 ) . 

(ii) Since E( e ) - n 
ijk 

: °v(n ^ i 

ij e l 1 j■> = I z EU 

k k ' lJk i'j 'k' J e lk E( e . ) 

0 lk 1' j 'k 


J t.i + °u.ik j 


(5.45) 


(5.46) 


(5.47) 


(5.48) 


tW: 

f 




'•SffijS 
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CovOl.jfVj.) = E ^ n ij n i• j |) 

= 1 1 E( 6 iik 6 i'i ■k> ) 6 -v 9 - , 

k k' ijkijk 1 ik i'k' 


(s 


4$) 


(5, 


46) 


= 6 


li 


. , £ £ E (<$. 6. ) B 

L k k' x J k i'j 'k' ; 9 ik 0 ik' ’ 


k k 
Thus for j = j • 


Cov(n *rV j .> ■ «u. I \ k ,e lk e ik . 

= s ii• v ' 1 |4= s ii< H-v" 1 ) o„ a . 


and for j ^ j 1 . 

CovU^,^ ,) 

-1 


(5.47) » « v" (v-ir 1 2 £ (M h .) 0 . v e.. , 

- L - L k k' kk lk lk 


x -12 

= - s ii' v Vi' 


(5.48) since £ 0 = 0. 

k 1K 


Q.E.D, 


From (5.44) 


x. = y + t.+n . + e . 
•J J .J -J 


Therefore, by Lemma 5.4, the variance of the estimate <J> = Z c.x 


of the contrast <f> = £ c j t j wittl £ c^ = 0 is 


J . 


J -J 


Var (<l>) = £ £ c c [Cov(n ,n )+Cov(e .,e . ,)] 

j j I J J • J • J • J • J 

J ", 11 Vj' [Cov( \j'Vj' )+Cov <V' e i.j.>J 


-2 


where 


= b 1 (cr 2 +cr 2 ) £ c 2 + £ c 2 a 2 , 

U u . j j J t. j 


(5.49) 


a u = b 1 f a u.i 


CT u= b "^Vi = b “ V 1 ^Vik. 


(5.50) 


Ml 






^Panaivsis of variance bl ock. °l refers to the 

^ unit error W1 ^-iated with units and a 2 

*» d °U r6fer J *° e technical * rr ° rS /^th the jth treatment. ^ 

components of associate 

refers to the components rand omized designs. For the 

we now consider the randomized design with the 

sate of comparison of the as in the case of the 

randomised block design we^ ^ • vb eXper imental units with t„ 0 
randomised block design, nom ^ b blocks of v units 

indices i.J •• «>ough they ^ ^ for thl6 fictitious 
each but of course no atte ^ ^ treatments to experimental 
blocking in the random assi-gnmen^ ^ game experimental material 
units. We shall also assume ^ £igld expe rimentation the shape 
is used for either design. F idiots) are exactly identical 

- «* — •' TZ «.«..™ 

hni-h r3e<;ians. Under the adaiuivj. j 
measurement x with the treatment J on the plot k in the ith 

block, can be written as 

x uk - k + + B ik + "uk + u ijk (5 - 511 

i = 1, ...»b, j = 1/.../V 
k = 1/ •. • / v t 


fhere 3., = 0., + a. is a unit ef 
ik lk i 

Like 0..) satisfying E E 3.. = 0. 

ik i k lK 

lere 


a unit effect (not within the ith block 

ft — n 


dj? = (bv-l) -1 £ E 8 2 
U i k lk 


(5.52 


is a measure of the magnitude of the unit effect, to be 
distinguished from c in /c jc) ■ u 

Within hint ^2 * 26 h h measure s the unit effects 

within blocks. The cj rinoe j 

units are grouped but"*. 0 2 °“ hOW ex P erimental 

U oes * Now from (5.52) 


(bv-l) = £ Eg 2 

i k lk i^“i +e ik> 

= V(b - 11 "a +b(v- 1)0 2 , 

^ the ° b — Jth treatment 


(5.5: 


TTlo=> v\ 
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: u + t,+b^££d (g . , . . 

J i k ijk ik ijk + U ijk * (5.53) 


the random variables d D i av 

ijk P la y the same role,as the 6 , ,, 

the randomized block design and are define 


1 / if the jth treatment 


l are defined by 
is applied to (i,k) unit. 


ij* 1 ^ 0 , otherwise, 

and from Lemma 5.1 


E ^ijk' d i 1 j 'k j) 


(5.54) 


vb-tTT ' if (i ' k) * (i' ,k') , 

0 / if (i,k) = (i* ,k') . 


(5.55) 


From (5.49) the variance of the unbiased estimator 3> = Z c x 

j j • J 

of the contrast £ c.t.( Z c = 0) is 

j J J j J 


-1 r / — 2 


Var(<f>) = b Her,. + a ) Z cf + Z c.a^ ) .] 

U u j J j J t.j J 


(5.56) 


From (5.49) and (5.56) we conclude that the randomized block 
design is more efficient if 
2 ~2 

a <a /c c-n 

U U (5.57) 

or equivalently > v 1 and is independent of the contrast 
studied. The efficiency of the randomized block design relative 
to the completely randomized design is defined by (from ( 5 . 49 ) 
and (5.56)) 

(° 2 + ° 2 ) * c? + £ c? a? . 


( a * + ° 2 ) lc. + E c'a^ 
u u J j j t.j 

which is dependent on the individual contrast studied unless the 
a t/ j are all equal to o 2 (say). In such cases, using (5.52), 


g 2 , n 2 . 2 ' -2.2 

U IT + a + a a + o 

e u t _ u e 

‘u + a u + a t " a u + a l 

(h _ 2 X . i \ /, 


(5.59 


(b-l) (va^+a 2 ) + b(v-l) ( o 2 + o 2 ) 
~ - , o e u e 


(bv-1) (a 2 +a 2 ) 
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where a 2 = o 2 ♦ ° 2 - From Lemma 5.3 and equation (5.39, and ^ 

2 .n h up aet 


6 


fact that < , - «J ®» a11 J' We gSt 

*- * J* 


Qi 2 2 

e (—M =vcr + cr , 

v b-l B e 


C, 


(5 


E( 


■» = °U + °e 


.60) 



(b-lj (v-1) 

From (5.59)-(5.60) the estimate is given by 


A 

e 


q l " 


b „ 
(b-1) q 3 


jb^ri: 


(5.61) 


^— * / __ 

(b-l) (v-l) “ q 3 


lf}40) concluded from the analysis-of experiments 
carried out at the Rothamsted experimental station and at its 

k~*- - 1 0 27-1934 that the mean error sum of 


Cochran (1938, 
carried out at 
associated centres between 19 


2 

squares (the estimate of a ) for the randomized block design was 
sixty percent of the mean error sum of squares for the completely 
randomized design. Thus assuming that all experiments have equal 
cost factors it would require 10 replicates of a completely 
randomized design to obtain an amount of information equal to that 
from a randomized block design with 6 replicates. 

5.3. LATIN SQUARES DESIGN 

5.3.0. Introduction, In the randomized block design the 

treatments are assigned to hi nrVo a +. j 

y Dlocks at random such that each 

l::r: "T ° CCUr in “ Ch block than assigning them at 

random over the entire set of experimental units as in a 

xpe IlI Li a a nd ° miZed deSi9n - ^ 3 l3tin design the 

Tirir iVided int ° rOWS ana each having 

number of experimental units (eaual .v, w . 

treatments) and the treatments are >u 

in such a way that each treatment 00 ^^ ^ "* "" 

each row and in each S once and only once in 

in each column. Thus s ~ a-v , 

treatments are grouped into r l • ™ latln sc I Uare design the 

and once in columns. Whereat 1CateS in two wa Ys once in rows 

in the ra ndomized block design, only 



% I 




n was 
Letely 
equal 

i 

to that 
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the block variation is eliminated from the within treatment 
variation, in th<e latin square design, row and column variations 
are eliminated from the within treatment variation. As a 
j resu lt in the latin square design the estimate of the error 
variance can be considerably reduced. 

A latin square design is an incomplete 3-way layout in which 
each of the 3 factors - rows, columns and treatments - is at v 
levels and observations on only of the possible treatment 
combinations (each treatment combination contains one level of 
each factor) are taken. 

Latin square designs were originated in agricultural 
experimentation by Fisher (1926). Now-a-days they are used in 
industrial, laboratory, greenhouse, field, medical, marketing and 
sociological experimentations, from comparing^ group of 
fertilizers to testing biological assays, from testing different 
brands of tea to comparing patients in the hospital, etc. 

Despite its popularity it is only good if the number of ’ 
treatments is not large. 

5.3.1. Latin Squares: A latin square of order o „„ „ 

of P symbols in p 2 cells arrant • P is an arrangement 

tw . ■ arranged m p rows and p columns such 

"limn Sy ° CCUrS ° nCe and ° nly °" Ce in — in each 

Example 5.3. 


“ a latin square of order 4 with the latin letters a n o „ 
symbols. r-ers A,B,C,D as- 

A latin square is in thfe standard form if th. 

Jitst r ° w « — coi^TiTThe ' n the 

- ^rr::i:; n 0 ; q r: t ::;. oaiied ««■*— 
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A latin square is called self-conjugate if its arrangement i n 
rows and in columns are the same. The latin square in Example 5.3 
is self-conjugate. 

4 

A set of all latin squares obtained from a single latin 
square by permuting its rows, columns and symbols is called a 
transformation set. 


From a standard latin square of order p we may obtain pi (p_i) j 
different squares by making the pi permutations of columns and 
(p-1) 1 permutation of rows which leave the first row in place. 

Thus the number of different latin squares of order p in a 
transformation set is pi (p-1) 1 times the number of standard latin 
squares in the set. For example, for a latin square of order 2, 
there is only one standard square. The same is true for a latin 
square of order 3. For a latin square of order 4 there are 4 
standard squares and for a latin square of order 5 there are 25 
standard squares. This number increases rapidly as the order 
increases. Fisher and Yates (1934, 1948) obtained the standard 
squares of order 4 and 5 and the transformation set of order 6 
Norton (1939) has tabulated the 562 sets from which it is possible 
to obtain the 16,9 27,968 standard latin squares of order 7. H 

If two latin squares of the same order but with different 
symbols are such that when they are superimposed on each other 

LTth rdered ° f Synfc ° 1S <different) exactly once 

then they are called orthogonal . Y ' 

A pair of orthogonal latin squares, one with l a t< 
and the other with greek synbols fbrms a Syinbols 

Example 5.4 


greco -latin square, 


A 

B 

C 

D 


B 

A 

D 

C 


C 

D 

A 

B 


D 

C 

B 

A 


a 

6 

Y 

6 


Y 

6 

a 

6 


6 

Y 

6 

a 


6 

a 

6 

Y 


squares of order 4 * 

r 4 and they form 


are two orthogonal latin 
greco-latin square. 

rr* *■ • - •> 

hyper greco-latin 



'*Y- •••'-ST*-.- 


-- mutually ort-v, 

order p is at moat p-1. A co „„ lete se ° 9 °^ ^ tin stares of 
Squares of any order except 6 ev^*- orthogonal latin 

this context and the construction of orth 1631111:8 in 

refer to MacNeish (1922), Bose mq ogonal !atin squares we 

Bose, Shrikhande and Parker 'I960) r # M9nn U942) ' Parker (1959 
I960), Raghav Rao (1971) Das and r‘ ° S ^ and shrikh ande (1959a,b 

5 * 3 * 2 ' M alysis of Latin Square Design s (One Observ^n^ 
Per_Cell) : In designing a scientific experiment in a latin squa 
of the required order p, one latin square is selected at random 
from the set of all possible squares of order p. This is achie^ 
by selecting a standard square from the set of all standard 
squares with equal probability and then randomizing all the 'rows 
columns and symbols. For detailed instructions on randomizatii 
we refer to Fisher and Yates (194 3) . 

In developing the analysis of variance technique under the 
assumption of normality, the probability model we consider is a 
conditional one, given the condition that a particular latin 
square has been selected for the experiment. 


* X ijk' 1 = 1 '“" V ' j " 1 . V ' k ‘ 1 . V ' 

denote the observation on the kth treatment in the ith ro 

jth block. The triplets (i,j,k) take on only the v val 
indicated by the particular latin square selected for the 
experiment. Assume that the corresponding random variabl 
| are independently normally distributed with 

U ijk - E ^ X ij> = * + “i + B J + S ' 

Var(X ijk ) “ 2 ' 

l where c. is the ith row effect satisfying l - °, Bj 
Jth column effect satisfying l Bj = 0 and t k is the ktt 

J ■ i - _— 1. i rtnc 


satisfy 
} are ca' 
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ffpcts of columns and <t k > are called the mai n 
called the main effects or co K 

effects of treatments. 

We can also write 

x ijk = E<X ijk 1 + e ijk J , * ’ 6JI 

where the v* random variables e ijk ate independently normally 

distributed with mean 0 and variance 0 . We shall consider the 
following hypothesis 

all a. = 0, H c : all Bj = 0, H T : all = 0. (5.64] 

In the notations of Section 3.1 each hypothesis is a linear 
hypothesis with r = v-1, s = l+3v -3 = 3v -2 and N-s — v -3v+2, 
The least square estimates of the parameters (y, c^/Bj #t fc ) under! 
can be obtained from the following' identity (with S denoting the 
v values of (i,j,k)) 


(i, j ,k) eS 


‘ijk - ■* - “i - B j - V 


= E lx, -x -x .+2x...) x +v E(x. -x -a.) 

(l,j,.k)e S 1JK 1 “ ** x i 1 . 1 

^ V J <X . j .- X . . .'V 2 < X . ,k- X . . .-V 2 - ^ . ,-U) 2 

’rom (5.68) it follows that 

" * * 

1 ~ x i “ X / 3. = X . -x . t, = X -X 
1 1 ** J -J. ...' k x ..k x ...' 


(5.6 


/V A 


ijk = ^ «i + v t k 


,j!k)e S <X ij^ijk )2 = ( J (k)£S (x i jk - x i..- x .j.-x.. k+ 2x...) 
ider H r we still have (5. 


( 5.66 


= x . -x . t =v ft 

* J * -•• k ..k~ x » y= x 


* « * 
jk ~ V + B + C 


and hence 
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(5.67) 


, the best invariant test of l eV el a of H 

Av whenever R R 

<5 - )(V-2) v 1 ( x , -x ) 2 / 2 / 

' 6 *) ^ l . / ijk~ X i..“ X i “ x v +2x > 

Uy - / (i,j,k) £S , * 

^ r. O 


> F „ 2 

- l-a,v-l,v -3v+ 2* 


(5.68) 


(5 Similarly the best invariant test of level a of H rejects H 

•° 4 ) whenever c C 


(v-2) v Z(x -x ) 2 
iv+2. j 


der A 

" > F 


(i,j,k)eS 


( x iiv -x i “ x i “ x t, + 2x ) 

ijk i.. . j. . .k ... 


j the " 1-a,v-1,v -3v+ 2 ' 


(5.69) 


and the best invariant test of level a of rejects H_ whenever 


(v- 2) v I ( x _ k -x__) / z <x ijk- x i..- x .j.- x ..fc + 2='..) 

/ (i, j ,k) eS 

’* r 

> F 2 

- r l-a, v-1, v -3v+2. * 


(5.70) 


The analysis of variance table for the latin square design is 
5.65) given in Table 5.6. 


f'- Table 5.6. ANOVA for latin square design 


Sources 

of 

varia- 


d.f. Sum of squares 


Expected mean 
sum of squares 


Rows v _i q = Zv(x. -x ) 

A -L m • • • 


Columns v -1 ct 0 = Ev(x -x ) 

Z . • J • • • • 


2 ^ 2 ^1 

0 + “ 1 a i v( ^ 2) q“ 
i 4, 

a 2 + —“ Zv 2 v(\>-2) — 
^ jJ % 


6 6 ) Treat- u-1 <J 3 = * v(x . .k' X .. 

>0 rrents k 


C 2 + —Z t? v(v-2)-^ 

v- 1 q 4 


Error v 2 -3vf2 q 4 = 2 (x ijk'’ X i.." X .j." X ** k+2X *-* ) 

^ (i,j,k)eS 


2 
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h is significant at a 

F value corresponding T 

in case the F-vai e the multiple comparison 

le v el Of : 6ivi * e the entire set of 

method discussed in Chap such ^ the tree 


given 


method discussed in Chapter 3 that the treatments ln ' 

treatments into and those in * 

the same subgroup - treatItEnt effects, 

different subgroups have differe 


Example 5.5. In order to study the effects of varying doses 
of superphosphate on the yield of sugarcane crop under 
unirrigated condition an experiment was conducted at the sugarcane 
research station, Rayagude, Orissa, I-ndia. The treatments are 


T x = control (zero lbs of P 2 0 5 as superphosphate) , 
T = 40 lbs of Pper acre as superphosphate/ 
r 3 = 80 lbs of ^2°2 per acre as superphosphate, 

T 4 = 120 lbs of ^2°2 Per acre as superphosphate, 

T,- = 160 lbs of P 2°2 per acre as superphosphate. 
Plot size = 39' * 39' 


Design: Latin square design of order 5. 

The layout and the yield in kilograms within brackets sugarcane 
per plot is given below 
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urces 
U iti° nS 



experimental designs 

T able 5.7. ANOVA table for t he sugarcane data 

sum squares 


tments 


4 

4 

4 

12 


19,544.64 
4, 145.84 
5,989 .84 
17,476.72 


1.28 


24 


47, 157.04 


- 2.48, if a - 0.10, we accept H , the 


Since F l-a,4,12 r - 

»sis of the equality of treatment effects with high 


bypothes 
probability. 


^ 3 ^ 3 , Missing Observations in the Latin Square Design : Allan 
and Wishart (19 30), and Yates (1933) presented a formula for 
estimating one missing observation x^^ in a latin square design 
of order p, given by 


v(x. 


+x . +x 
» « _1 . 1 


) - 2k 


(5.71) 


(v_l) (v— 2) 


As in the case of the randomized block design this estimate has 
ieen obtained by minimizing the error sum of squares, in the 
^analysis of variance table of the Latin square design, with 
J respect to the missing observation. In the original layout we may 
Jf replace the missing observation by its estimate and perform the 
| usual analysis of variance with the single exception that for the 
, datum estimated 1 degree of freedom is subtracted from the 
: actual degrees of freedom v 2 - 3 v+ 2 of the sum of squares for 

errors (q, in Table 5.3.) . As pointed out in connection with the 
1 randomized block design, the F-test obtained by this approximate 
approach will have an upward bias. However, if the F '’ a * 
^-Significant we stop further investigation, but if 
<■ significant we cannot be sure that it is due to treatments 

due to the bias. 

11 t =0 the following 

ft To obtain the exact tefct ot a k arded as a 

1 !* eaure ^ adopted. The experiment is ^ 



" uure ls adopted. me missing 

'* a Y classification in rows and columns with 
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m 


observation. The minimum error sum of squares aj(say) for this 


classification (with ^respect to the missing observation) i s 

2 2 

obtained. Its degrees of freedom is (v-1) -1 = v -2v . The 


quantity (w*»q 4 ) is the exact treatment sum of squares with (v-i) 
degrees of freedom. The exact test of H,p with one missing 


observation is to reject if 


v —3+1 , U) ~ q 4 ) > 2 

v-V U q 4 1 ~ *l-o,v-l,v -3V+1 


Yates (1933) has given an iterative method for estimating 
several missing observations in the latin square design. 

Bartlett (1937) suggested a procedure of inserting a "one" for 
the missing value and zero otherwise and performing a covariance 
analysis with the zeros and the one as the concomitant variable. 


If more than one observation is missing the same procedure 
is followed except that a covariance analysis with more than one 
concomitant variates is performed. Nair (1940) used the 
Bartlett procedure for analysing a latin square design with 
several missing observations. DeLury (1946) summarizes most of 
the available results for handling missing observations in a 
latin square or sets of latin squares. 


5*3.4. Analysis of Covariance : The analysis of covariance for 
the latin square design is a straightforward extension of the 
results on the analysis of covariance of the two-way classifica¬ 
tion in Chapter 4 or of the randomized block design in Section 

5.2. 


The linear model for the observation x ± >k from a latin squar< 
design of order v with a single concomitant variate z is given 

by v iJk 


X iik~ u + a i + ^ + t + y z . ,. + e 

1 j k ijk ijk 

where y is the regression coefficient. 


(5.72) 



Let 
R 


xz 


Z 

i 

Z 

J 


'’<X 1 -X Hz. -Z ) 

•*■••• ••• 1 .. ... 

V (x . -X ) (z . -z ) 
•J.J . ‘ 


(5.73) 
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. . k 


-z 


>, 



xz 


( i, j , k) ES 


(X ijk" X i.." x .j -x 


.k 


+ 2x 


) (Z iik- Z n “ Z • 

->-J X 1 . . . j 


- 2 ..k +2z . 


). 


for C , c etc. 
xx' zz' c 


H n , H_ and H„ 


and let R XX *R be the values of r wh __ 

respectively. w ith simil ar iuter ^“ 

*“• oe co, ariance for testi h ; 

is given in Table 5.8 if = P “R' “c ailu **T 

. y on& of the tabulated F-values are 

significant we can dcterini no .. ' 

u . „ . ^ ermine by the multiple comparison methods 

which of the contrasts in th<= riar-an-,^*. 

ne parameters concerned are responsible 

for the rejection. 


5.3.5. Randomization Mod^i 


(a) Estimation . 


\ 


We have already considered the randomization 
model for the randomized block design. We shall now consider the 
same for the latin square design. For other designs to be 
discussed in this chapter we shall merely state the results on 
permutation in relevant places. The randomization model that we 
shall adopt for the latin square design will be valid regardless 
of how we choose the transformation set containing the latin 
square actually used in the experiment provided that all squares 
in the transformation set had the same probability of being 
selected for the experiment. Let x denote the observation of 

lj K 

the kth treatment in the ith row and the jth column and let 
be the corresponding random variable with 

(5.74) 


E(x ijk> - "ijk 


Then we can write 


x... = 11 ... + e. .. 

ijk ijk ijk 


72 ) 


(5.75) 

technical errors with E(e iJk ) =0. To simplify 

restrictive assumptions about 


where the e.. ^ are 

the presentation we shall »>*e more ^ block design . 

the e. jk than what we have o ^ independ ently distributed 

We shall assume here that ijk 2 the terminology of 

with mean 0 and the same variance a • factors A (rows), 

3-way classification in Section 
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B(columns) and C (treatments) we can write in terms of the grand 
mean, main effects of A,b,C, two factor interactions of A,B,C 
and the three factor interaction of A,B C ^ 


x i1k = M + + «? + cf + a AB + a AC + c . 80 + a ABC + e 

1JK 1 J K 1 j U ik “ij- + “ijk + e ijk' 


where i j _ 1, ;..,v , k = 1, 

£ = £ a® = £ of = 0/ 

i j J k k 


(5.76) 


v and 


2 ct A ? = £ a AB = E a AC ' - 'z a AC - 
i j 1J i ik k * k ' 

Za AB F = Za AB ? = z a ABC _ 

± IJ* j iJk £ -ijk 0 ' 


- BC _ BC 

*°jk = l “ik . = °> 


(5.77) 


It may be noted that in Section 5.3.2 we have taken a A = a.# 
B a C i i 

a j _ p j • a k ^ and all interactions to be zeros. 

Let the randomization as ih the case of the randomized 
block design generate random variable given by 


1, if the kth treatment occurs in the ith row 
= - and in the jth column , (5.78) 

0, otherwise , 

such that the <5. are independent of the e. 

ij k i j k. 

Then, from (5.76) 

x. . = £6 . .. x. , x. , = £6 . ,,x, , 

ij. r ijk ijk l.k ijk ijk 

K J (5.79) 

X ,, = £ 6 . . .X . .. , 

.jk i ijk ijk 

where the x are to be replaced by (5.76) and the x. . is the 
ijk ij • 

observation in the ith row and in the jth column, etc. 

The observed means for the ith row, the jth column and the 
kth treatment are, respectively. 


(5.79) 


x. = - £ E6. .. x. .. , x . = “ 

i.. v . ijk ijk .j. v 

J K 

x . = — £ £6 . .. x. / 

• •k v . j ijk ijk 

and the observed, general mean is 


Z £ 6 ijk x ijk' 


(5.80) 
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= i_ Z 


2 c? ^ijk X ijk* 

v 2 (i,j,k)es 



Since 


Z 6 - Z <5. .. = Z 6. , v = 1# 

i ijk j M * 5 k - l J k 


from (5.76), (5.77) and (5.80) , 


x = M + a* + v " 1 I £ S ijk «>?j B + “ijk + e ijk> ' 
* • i j 


ABC x 

* j. ■ " + a j + V_1 f l s ijk (a ik + “ijk + 6ijk ' 


(5.81) 


i k 


x ..k ” 11 + °k + V ‘ 1 1 * 6 ijk (a ij + a ijk + 2 ijk’ • 


Lemma 5.5 . 


-1 


(i) E(6. sk ) = v-* , 


(ii) - 


r v _1 , if i = i’, j = j’vk = k' , 


0, if exactly two of the 3 equalities 
i = i'/ J = j', k = k' hold. 


a<a iJk 6 i. J .k. ) - 


v (v-1) , if exactly one of the 3 equalities 


i = i', j = j ', k = k' holds. 


£ 1 (v--l) 2 (v-2) , if i ^ i', j £ y t k ^ k 1 


Proof . (i) Since the randomization is equivalent to 

randomizing on rows, columns and symbols, (treatments) so that 
all symbols k have the same probability of appearing in the 
(i,j) cell (ith row and the Jth column) 


E <« iJk ) = P{« ± -1) = v- 1 . 



* ' J j’,k-k',by Lemma 5,2, 

E(4 iJk> = *<«„„-« = V* 1 


ijk ' '"ijk 

For k ^ k> , 
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p{6 ijk 6 ijk' 

P< 4 ijK - X) P{6 ijlC = 11 " 11 

= V- p{<s ijk' = 1 ^ <s ijk 55 1} = 0 * 

aS the conditional probability that the symbol k' falls in the 
(ij)th cell given that the symbol k falls in the (i,j,k)th cell 
is zero. Thus from the symmetry of the latin square design 
with respect to the three factors A,B,C, 

E ^ijk ^i'j'k'^ = ® exactly two of the 3 equalities i = i 1 / 
j = j ', k = k' holds. 

For i = i', j 7* j ', k ^ k', consider the symbols in the ith 
row only when some permutation of the symbols 1 ,.. . ,,v appear. 
Since the randomization generates a random permutation of the 
symbols all vl permutations are equally likely in the ith row. 
Of these permutations (v-1) ! have the symbol k in the jth 
column and of these (v-2) I have k' in the j'th column. Thus 
the conditional probability 

= ttcttt = ^rr • 


‘ k 'ij 'k' ~'"ijk " (v-1) i v-1 ' 

Hence 

E( 6 ■ .v 6 • ■ .) = v -1 (v-l) -1 . 

ljk ij 'k ' 

For i ^ i', j ^ j', k^k' the conditional probability that the 

symbol k will be in the (i',j') cell given that the symbol k in 
the (i,j) cell will be the same for all k' ^ k, because of the 
random permutation of the symbols. Hence 

(v_l) ” 1 (i_p { k is m the (i\j') cell | k is in the (i,j) cell} ) 

= ^-l)- 1 (i_ P {6. = l|6. jk = 1» = (v-l)" x (l-(v-l) _i ) . So 

E(6..,6. . , ) = v~ 1 (v-l) “ 2 (v- 2) . 

ljki'j'k' ; V Q.E.D. 

Using the notations of Table 5.8 and Lemma 5.5, one can, after 

some straightforward but complicated computations, obtain from 

(5.80) the following theorem. 
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‘Theorem 5.6_. 


= o\ + d-2' , ' 1 > °ABC + °AB + V °C ' 

El^y) = 0 2 e + (1-2V* 1 ) O^c + °BC + V °B ' 

S(^j) = o 2 + (1-2V- 1 ) o 2 ^ + <4 + v2 °A ' 


1 2 2 2 2 
.) = a 2 + (l-3v ) a ABC + a AB + o AC + °bc #. 


v -3v+2 
where 

a 2 = (v-1) ~ 1 Z (c£) 2 , 
A i 1 


a 2 = (v-1)- 2 Z Z (o£ B ) 2 , 
l j 


2 / ,, -3 r. r v / ABC, 2 

°ABC= (V - 1) l J * ( “ijk> ' 

2 2 

and similar expression for other quantities like Og* °BC' e ^ c * 

From Theorem 5.6 we may conclude under the hypothesis H (of 

k 2 

equivalently H T in Section (5.3.3)) all a c = 0 or ct g = 0 , that is 


°4 Q 3 

E ( r z —-—•) >> E (—4>) (5.8 2) 

V -3v+2 X 

2 2 2 
if a* < via + a ) 
ir ABC v AC BC ' * 

From (5.77), (5.81), and Lemma (5.5) 

E(X §k ) = H 

• i 

no matter what interactions are present. Hence 

/v 

^ £ C k X ..k is an UI *iased estimate of the contrast 

*= \ with i .<£ - o v . . 

i..tr Th ir 1CUlati0n ° f Var(;> ' Uslng (5 - 81 > and Lemma (5.5). 
19 orward but complicated and nothing is known about 


(5.82) 
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lts estimate. However, Worn Theorem ( 5 . 6 ) and (5.81) if c* = 0, 
it be argued for the estimates * of j, + a£ that the errors 
- y - , k = 1 , ..., v, 

are syronietrically distributed, and in particular, they have equal 
variances and covarjjances. under these assumptions we can prove 
the following theorem. 

2 

Theorem 5,7. If = 0, then 

Var(J) = + a^ B ) E . (5.8 3) 


Proof . Write, for k = 1,...,v 


0R = ll + a k ' 

6 k = X ..k = 6 k + f k ' 

where 

( j 

f k = V- 1 Z Z « ijk (a^ B + e iJk ) . 

Obviously 

E ( e k ) = e k , E(f k ) = o . 

Since Z CL = 0, J = Z c k e k is 

k K k 

Let 


(5.84) 


unbiased estimator of \jj, 


? (0, — 0 ) A — 1 V n 

q = — -IS—I- , where 6 = v k e k 

v- i 

As a with 6 = v“ ^ I 6^ » 

k 

K -®. = 9 k - 9 . + (f k' f . ) ' 


(0 -0 ) 

e(q) = i —!s— 

k v-1 

■ (V 6 > 2 

k V“1 


+ -L- E(Z(f k -f ) ) 

v-1 v K * 


( I E )} 

k 


1 

+ v-1 
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= * ( 9 *v : -r~ + A < i* ( ^)-v 1 * ( i f k. • 


Unde.: the assumption °^ BC - 0/ for a11 k ^ k 


E(f 2 ) = Var(f k ) = a 1 


? (say) 


E(f k f k ,) = Cov(f k< f^,) = (say). 


From (5.85) and (5.86) 

(9,,-e ) 2 


' V 

E (Q) = Z —~rf - + 

k 


—- (va 2 - v 1 (va 2 + v(v-l) p o 2 ) 
v- 1 1 l •*- 


(6 -6 ) 2 ' 

•- + ofd-P) • 


Var (J) = E ( I c k ( ®k~V } 
X 


■ E! l c k £ k> 


= E a 2 I C 2 t puj £ £ C C, , 

1 k k 1 k k. k k ' 


k k 
k ^ k 


= o 2 (1-P 2 ) Z C 2 
1 k K 


From (5.84) and (5.87) we get 


°l ( l- p ) - v_1 (°ab + °e> 


From Theorem 5.6 it is obvious that 


-1 q 4 o 

v -r 2 - 1 

v -3 v+ 2 k k 


(5.86) 


is an overestimate of V*r<« unless o 2 =0 and o 2 - 0 . 

AC BC 


With the ««. (5.76) there does not 
test of H 0 : all-' of = 0 or equivalently 


<3 

1 
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°C ~ 0 A p ® rmutatlon test Of the hypothesis H' that the factor 
c (treatments) has absolutely no effect in the sense that 

2 2 „2 2 

a C ~ °AC a BC ~ a ABC ~ 0 

and that the v technical errors e occurring in the experiment 
are distributed like v variables e.. whose joint distribution 
does not depend on how the levels of C are combined with the 
levels of A and B. The joint distribution of-the e i , is subject 
to the conditions Me^) = 0 but is otherwise arbitrary. Under 
H' , from (5.76), 


(5,8?) 


x iik = y + a i + a ? + aAB + e - ■ * 

ijk 1 J ij ij 


( 5 . 88 ) 


Since the particular latin square used in the experiment is 
selected at random from only one transformation set, the group 
G (as in the case of the randomized block design) of permutations 
is the set of all (v!) ^ permutations of the v rows, the v columns 
and.the v treatments (symbols) . If the transformation set was 
first chosen with certain probabilities from the totality of all 
transformation sets the permutation distribution of any test 
statistic or its moments are determined by those for the 
transformation set by weighting with the same probabilities. The 
permutation test of based on the statistic (see Table 5.4 for 
notations) 




F = (V_2) -Z 

is equivalent to the test based on the statistic. 

Q 3 

Q 3 +Q 4 ' 

which is a strictly increasing function of F. Since 

Q +Q = I X 2 . - v 2 X 2 " Q 2 " °3 ' 

4 (i,j,k)£S 1J 

the denominator of U is constant for all permutations in C-. 

Furthermore, since 


(5.89) 


(5.90) 


Q - -1 

- V 


r l i z x iJk ) 

k i j J 


( £ c x ijk> 



42 ANALYSIS of VARIANCE ^ all permutations in G, it 

X. is constan t of h ' on the 

nd /• i K) £ S ljK the permutation test o c 

(i»j» K;e f c+ . to base tne pc 
would be simplest to o 


statistic 

l T* , Tr = E R X iJ k ' i 

* 1 f dlf£e rent squares in a transformation 

HOW, Since the number of A standard squa res in the set and 

in VI (V-1M times the numb ^ v , permutations of v 

the statistic U is invarl n olude that the nuirber of 

symbols (levels of . ^ y tak£S on wit h equal probability 

different values t e times the number of 

for a aiven transformation set is (v-D 1 times t 
for a aiven systematic way of calculating 

standard squares in the set. ay 

o r=t the v 2 observations in a square 
these values is to arrange first the v 

with the rows and the columns corresponding to the levels o , B 
respectively and then applying for each of the standard squares 
of the transformation set the (v-1) i squares which are 
obtainable by permuting the '(v— 1) rows of the standard squares 
after the first. One could also very well calculate the value of 
the statistic U first only for the latin squares that look as 
though they might give the statistic U a value greater than or 
equal to that observed. In other words one calculates the 
probability that U is greater than the observed value. 

From Theorem 5.6, it is easy to see that' E (U) is the same in 
the permutation distribution as in the distribution of U under 
the assumption of normality of the X. . Welch ( 1937) has 

practical use. Thus the corrections needed in the degrees of 

L^lT stV n the ° f thF randomize a block design) of the 

^ual test based on the assumption of normality of the X to 
make it approximate the exact nUmi* +-• ljk 

feasible to calculate How 91011 tSSt better is hence not 

numerical examples showed that tK ° n ® baSiS ° f *° W 

Is not too bad an annro • 6 n ° rmal theor Y test based on U 

5 . 3.6 E ff . “-nation to the permutation test if v > 4 

Square Design, 

Classi fi<=a tio n of ^ ... 1 18 desirab le to find out if th- 


rr = E 

* i 


for a given transrormatiw 
standard squares in the s< 


it is 


of the 


desirable to find out if the 


experimental units by 


rows or by columns or 




it 
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0,1 s ‘ 




1 


w both has led to increased precision in the . . 

.hall find out here the efficiency of the latin Periment- 

relative to the randomized block design and to ^ 

r , , ign and to the completely 

randomized desrgn. The completion in this case goes in the saae 

„ay as in the case of the randomized block design. So .we shall 

jterely state the results. 


to 
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A, B 
‘es 


The estimated efficiency of the latin square design relative 
the randomized block design made by rows as blocks, is given by 


v-1 


, „ H 4 

(V 1} tv-1) (v-2) (v— 2) 


q 2 + (v- 1 ) q 4 
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The estimated efficiency of th< 
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£ Of 


^.ue latin square design relative to 
the randomized block design made up by columns as blocks, is 
given by 

(v-2)q x + (v-l)q 4 

vq^ 


2 inf? 
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The efficiency of the latin square design relative to the 
completely randomized design is given by, 

(qi+q,) 

- T - 4 
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(v- 1) 
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_ are given in Table 5.3. 

The quantities q^^ ^ ^ e£ficiency of the 

Cochran (1938, 1940) repor ^ coroplete iy randomized design 

latin square design relative rando mized block design is 137 

is 222 percent and relative ° afc Rot hairstead and its 

Percent for the experiments C ° n ^ 34 . A sim ilar result was 
associated centres during 19 27 '| } who found that the 

Stained by Ma and Harrington^ 79 percent as efficient as 
completely randomized desiq 

l atin 




square design. 
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0F variance 


,r-v^d that if a - =■*: 

it may be remark . design, the best possibl, 


randomized block des; 


However, i* — in sq uare 

tes to be used instead of 1 view of th e experimental 

shape Of a block from the por ^ other than that given by 

procedure might have to be ^ many comparisons that are made 

rows or columns. This may 

in this way. . A D-, E- and 

_ . <. hat the latin square design 

The fact that the _ t optimum 


A-, D-# E- and 


11 follow from the A-, D-, *- and L-optimum 
of the Utalited Youden sguare design which we shall 


L-optimum will 
properties of the gene 
discuss in Section 5.4 


5.4. BALANCED INCOMPLETE BLOCK DESIGNS 

If the number of treatments v in an experiment is large, it may 
nor be possible for various reasons to use large size blocks to 
accommodate all treatments in each block. 

We may use incomplete block designs, that is, designs in 
which the number of experimental units in each block is less than j 
the total number of treatments. 

Let us consider an experiment where v treatments are to be 
compared in b blocks of sizes ...,k b . Let the ith treatment b 
replicated in r. blocks, i = 1.v and let the ith and the jth fi 

treatment occur together in A. . blocks, i -j = i . { 

i J ' ' J i*«..,v,i^j. j 

the tTt b V N = ‘V' "ij iE the of times 

ltr X 0£ th ° C r “ the ^ bl ° Ck 13 incident 

1 » ° «« «*• -complete block deli^n ^ 

y s called a binary design. 

The matrix 


c ~ D ( ri -,r v ) - n 

Where D ^!.6 ) 


'D(k 


1 ' * • * * k h ) N , 


,,e k' is 


elements 0. 

design. 


^^l ion 5 .t_ A 


is a k 


k dia 9onal matrix 


(5.91) 


Called the C-matrix 


w ith diagonal 


°f the incomplete block 


treatment 


contrast E 


effect 


j t j -S't, with 


s t=( t J J 3 ~~ 

- 1 t l .tj • i 


called 
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elementary 
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Itlcly 

; S to 


;s than 


act if £ has onl y two non-zero components 1 and - 1 . 

< -< m t u. the proof of various results presented below in 

H . | section with the properties of incomplete block design we refer 

C6 y 'V B aghsva Rao (1971). 

)r 

^ Elementary contrasts in the treatment effects involve all 

l ^j£fe rences i ^ j an d it is often desirable to design 

eK periments where all elementary contrasts are estimable. 

; | f jjiition 5.2. Connected design . A design where all elementary 

contrasts are estimable is called a connected design. Otherwise 
’ ^ is called a disconnected design. 

An incomplete block design with v treatments is connected if 
an d only if the rank of its C-matrix is v-1. 

may 

Chakrabarti (1963) has shown that in a connected design all 

s to 

the diagonal elements of its C-matrix are positive and the 
f principal minors of all orders of its C-matrix are positive. 

! npf inition 5.3 . Balanced design . A connected design is said to 
than be balanced if all elementary contrasts in the treatment effects 
can be estimated with the same precision (inverse of the variance 
> be j 0 f the estimator) . 

lent b£%+ Thls aefinit ion does not hold for the disconnected design, as 
e jtb tiUl elementary contrasts are not estimable in this design. To 
J- f f overcome this difficulty, Vartak (1963) defined a design (not 
times necessarily connected) to be balanced if every estimable 
3e nt normalized contrast in the treatment effects can be estimated with 

LueS the same precision. Rao (19 58) obtained the following two 

3S ign. theorems. 

T heorem 5.8 . A connected design is balanced if and only if all 
characteristic roots of its C-matrix are equal. 

: 9l) 

Let the nonzero characteristic roots of C be all equal to 9 • 
Then each of the elementary contrasts, in the treatment effects is 
10X00 es timated with variance 2 6' l o 2 where o 2 is the error variance 

\ the design is balanced. For the balanced design the C-matnx 

) ’ / ls of the form 


times 

dent 

lues 

esi9 n ‘ 
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C = e (I v - v-1]E VV* 


I ANCE 


whose 


is the v x v mat] 


C = vv fHx an d E is the v x v m-,.. 

„ ac v x V identity matrix ana n vv mat 

where I„ « the „ j. to unity, 

whose elements a r 

. 0 For any balanced design with ^ \ = r 

Theorem ’L l- (say) , the incident matrix N satisfy 

(say ), k 1 - •• • 


N'N = (r-v) l v + E vv (5 - 9 3) 

where A(v-l) = r(k-l). 

Note. An incomplete block design with “ *** “ = k is 

called a proper design. 

Definition 5.4 . Balanced incomplete block designs (BIB) . a 
balanced incomplete block design is an arrangement of v treatments 
(symbols) in b blocks each containing k experimental units (k < v) 
such that 

(i) every treatment occurs at most once in each block, 

Ui) every treatment occurs exactly in r blocks, 

(ill) every pair of treatments occur together in exactly \ blocks. 

The quantities v,b,r,k,A are called the parameters of the BIB 
design. 


with k ^ = 


• = k b = k is 


Prom Definition 5.4 it follows that 


Vr = bk 


(5.94) 


The Incident matrix N of the rir • 

the characteristic roots of its C !at SatlSfieS Theore “ 5.9 and 
Theorem 5.8 it ls a balanced “ re ™us from 


Example 5.6 


Block 


Treatments 

2, 4, 7 
2 ' 3, 5, 1 
3 - 4 . 6, 2 


Block 


1 « of a bib de 


ign With 


Parameter 


Treatments 
5 p 6, 1 , 4 
6 * 7, 2, 5 
7/ lp 3, 6 

S \J I — W _ n 
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'H 


Although Kirkman (1847) found a particular 


series of BIB 


NJ ^ ign£ - ^ tS development in statistical literature was 

ori ginated by Yates (1936) . S1 n ce then significant contributions 
* t *, were made, among others, by P isher ( 1940 , 1942 ), Fisher and Yates 

,-f 963) and BOSe dUrin = the y^rs 1938 to 1959. Though the 
'following lemma follows from Theorem 5.9, it is worth giving the 




(s 


,9^ 5 -following elementary proof of it here. 




iS- 1 "" 13 5 ’ 10 ' For 3 BIB design with parameters v,b,r,H, 
X(v-l) = r(k-l) . 


• A 

r ®atm eott 

s (k <»] 



(5.95) 

Proof. Consider all possible pairs of treatments each 
containing a particular treatment 0. Since 0 occurs in r blocks 
in the design and in each of these r blocks there are k-1 other 
treatments present, the total number of possible pairs of 
treatments in the design, each containing treatment 0 is r (k-1). 

On the other hand from Definition 5.4 of the BIB design, 0 
is paired with each of the remaining v—1 treatments exactly A 
times. So the total number of possible pairs of treatments each 
containing 0 is A(v-l). Hence 

A(v-l) = r(k-l) 


Q.E.D. 

From Theorem 5.9 the incident matrix N of the BIB design 
isfies N'N = (r-A)I v + E vv - Thus 

U' N | = (r + (v_l) A) (r-v) V_ 1 . (5.96) 

Since k < v from Lemma 5.10 we get A < r. Thus from (5.96), N'N is a 
nonsingular matrix. Now 

v = rank (N'N) = rank (N) ^ b • (5.97) 

This is due to Fisher (1940). 


Definition 5.5 . A BIB design which can be arranged in sets of 
blocks such that each set contains a complete replicate of all the 
treatments is called resolvable. 


In the case of resolvable BIB designs Bose (194 2) showed that 
the inequality (5.97) can be improved to 
v £ b-r + 1 


(5.98) 
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m 


„ £or a resolvable BIB design the eguality in (5.98, hol ds , ^ 
Celled affine resolvable. For the existence and the 
construction of BIB designs for different values of the para^^ 
we refer to Raghava Rao (19 71) . 

Generalizing the notion of BIB designs Kiefer (1958) defi^ 
design to be a balanced block design (BBD) if it is an arrange^ 
of V treatments in b blocks each containing k experimental units 
(not necessarily k v) such that 

(i) the number of times n^ . that the treatment j appears in the 
ith block is k/v if this is an integer and is one of the two 
closest integers otherwise, 

(ii) the number of replications of the treatment j is the same 

for all j, * ' 

f 1 •* 

(iii) for each pair* of treatments (j,j'), j ^ j' , 


n.. n.. 

ij ij ' 


is the same. 


Certain desigps of this type were constructed by Tocher 
(1952) . A special case of BBD. where each of the n. . takes one 
three preassigned values, was considered by Rao and^Das (1969) 
which they called balanced ternary design (BTD) . 

- Intra -klock A nalvRjg of rtr 

here the intra blnov -- ~ gns : We shall consider 

v.b, r, k, A under the normll theory 7L7T 

slightly more complicated than that of the tf m ° de1 ' “ 
designs treated so far Let ° ther complete block 

jth treatment in the ith d&note the observation on the 

occurs in the ith block) where f r °!' ia ° d that the Jth treatment 

“iJ ~ 1 ’. Let us assume that the cor rUnS ln ^ Set S f ° r WhiCh 
ij independently normally distribut 0 ^ 1 ^ rand ° m variables 


'^ X ij) ~ M + a. + t 
’--'C i 2 1 j 

ij' = a 


u + a. + t -l 

i + e.. 

J iJ 


(5.99) 


(5.100) 
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where (i/J> ' the e ij ar ® independently normally distributed 

w j.th mean 0 and variance a 4 


f 





i 


V is the grand mean, the a. is the 

it* *> lock effeCt satisf yi"9. s a ± = o and the t. is theVh 

treatment effect satisfying y t - 0. We ar e J interested in 

3 3 

testing the hypothesis : all t^ = o . The hypothesis H a : all 
= 0, which has no practical significance can be treated 
similatiy* The problem of testing is a simple case of a two-way 
classification with unequal number of observations per cell and 
without any interactions (as treated in Chapter 3) . 


First we shall consider the analysis of a general 
incomplete block design (not balanced) with the added restriction 


that 


r l = 


= r 


v 


= r 


and k. = . .. •= k, = k. Then we shall 

j. 1 b 

specialize for the case of a BIB design. In the notation of 
Section 3.1, the hypothesis H T is a linear hypothesis with 
r = v— 1, s = v+b-1, N-s = bk - v - b+1 = vr - v - b+1. 
squares estimates of y 
minimizing 


The least 


V 


t . under Q, can be obtained by 
J 


2 

s = 


(i, j) e s 


(x 


ij 


y - a ± - t > 


(5.10 1) 


with respect to y , 

2 

derivatives of s 


a. 

i 


and t . . Equating to zero the partial 


with respect to y, 


a , and t., 
i 3 


we get for 


= 1,...,b; j = 1/• 


v 


bk y + k E a. + r 


t. = 
3 


(i,j)e s 


x. ., 
U 


(5.102) 


k(y + a .) + 
i 


n 


r(y + t.) 


Z 

j 

Z n 
i 


iJ 


t, = B , 
J 1 


ij 


A 

a . 


!= t j 


where 


B, = ith blbck total = 2 

1 j 


x. 


ir 


Tj = jth treatment total 

Z 

j 


Z x. . , 

i 1J 


r = l n ij' k = ! n b 


Let 


Ok 
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r n.. n.. 
f U U 


= X 


JJ 


(5 . 1 & 3 ) 


Q. = T. - k -1 Z n. . B. = T. - k _1 T' , 
J J i ij l J J 


where? Tis the sum of all block totals in which the jth treatment 
occurs. 


For the BIB design 


X, if j ^ j ' 


X . ., = 

JJ 


(5.104) 


r, if j = j' 


The quantity is called the jth adjusted treatment total, the 


adjustment consists in subtracting from the jth treatment total T. 
the sum of block averages k for all blocks in which the jth 

treatment occurs. From (5.102) and (5.103) the estimates of t., which 
for reasons that will be obvious later are called intra-block" 1 


estimates of the t_. are given by (with 6 . . , = 1 if j = j • and 0 
otherwise) . 

j, <r 6 jj ' " k A jj ' = Q j • (5.105) 


The error sum of squares for testing H T or H a is the minimum value 


of s in (5.101)and is given by 


q 3 = l X iJ - ] V.j - k_1 J 


= [Z x 2 . 
S 1J 


- C.F.] - Z t Q - Ck 1 Z B 2 - C.F.] 
j J J i 1 


- (Total sum of squares] - [treatment sum of squares 

eliminating blocks] - [block sum of squares ignoring 
treatments!. 


where C.F. = b 1 k" 1 ( Z x ) 2 

S iJ * 


(5.106) 


Under Hj,: all t. - 0 the minimum value of s 2 is given by 

r Z _ 1 n ■» 


l X?. _ k“* I R* 

S . V 


■■” r “ i ■ tB, “ k ■- - «~*» 


(5.107) 







ffcus the best invariant test of i eve i a of 

hk-b-v±l E > F 

V - 1 j q 3 ~ 1 -a, v-l,bk-v-b+l ' 

We now specialize for the case of the 
(5.104). From (5.105) 


ER 'MENTAL DESIGNS AND THEIR ANALYSIS 251 


h t rejects H 


whenever 


(5.108) 




10 5) 


t, - k -1 A It 

k 1 j 


BIB design where X , satisfies 


J ' =Q J 


>ich i 


(5.109) 


(5.111) 


Because of the side condition 
Z t . = 0 

j J (5.110) 

we get from (5.109) 

= (rE) 1 Q j (5.111) 

where 

rk-r+A (k-1) v . . , 

£_ rk k (v_l) 1 * (5.112) 

If there are more blocks that contain the treatment j than do 
not# that is if r > b-r, it is easier to compute ■?^, where = sum 
of all block totals not containing the treatment j and to use the 
differences of the 
T.+k -1 ?. 

n — _J_2. (5.113) 


re 

for estimating elementary contrasts in treatment effects. This is 

because T. + T. = J x . From (5.111) 

- J , ( -l r o 2 (5.114) 

l t.Q. = rE) ^ 2. . 

j j j j 

. . variance table for the BIB design 

Thus the intra-block analysis of variance c 

is given in Table 5.9. 

' . . f icant and consequently H T Is rejected it is 

If the F-value is s g multiple comparisons, to divide the 

then necessary, by metho s that the treatments of the 

entire set of treatments an<3 those from different 

same group have the same ffec ts For this purpose we need 

groups have different treatments^ the ‘ comoB variance pa 2 ^ P i» 
to find the common variance a t ^ ^ thg intra -block estimators 

the common correlation coefficie following simple device. 


T . +k 1, T . 


(5.114) 


of t . . This ca 


::: ** £ ° nowin9 simpie aevioe ' 
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Table 

^ q. Intra-block ANCVA for 

the BIB design 


Sources of 
variation 

d. f. 

sum of 

squares 

Expected mean 
squares 

F 

Blocks 

ignoring 

treatments 

b-1 

*1= k' 1 2 

B 2 - C.F. 


< 

Treatments 

eliminating 

blocks 

V»1 



2 . _2 bk-b-v+1 

a +rea T (v_l) 

q 2 

q 3 

Error 
(intra¬ 
block) 

bk-b-v+1 

q 3 


o 2 


Total 

bk-1 

,, 2 

Z x. .- 

S 1J 

C.F. 



2 

where a T = (v 

-l) -1 I 
j 

S 2 - 





q 

t 


Note . Because of the symmetrical way in which the treatments 
appear in the design the t^ have equal variances and covariances. 


Since Z t =0, we get 
j J 

va £ + v(v-l) p = 0 , 


(5 


. 115 ^ 


and hence p = -(v-1 ) 
that 

A A 


-1 


From the Section 5.4.3 it will follow 


Var(t 1 -t 2 ) = (re)" 1 2a 2 . 

Thus from (5.115) and (5.116) we get 
(rE)' 1 o 2 = 2o 2 - - 2p 0 2 ' = 2v( v-1) 


(5.116) 


and 


- 1 a 2 ^ 

° t 


consequently 


Var (t , •) = 0 (v-D 
J re 


c °v(tt) = _ -SL 

J J 1 re * 


( 5 . 11 ’) 


given a~forlui a °to <! ‘ r bt l:i0nS ^ BIB Des1r ) n - Corn ish (1940) has 
Sc. orrcma to obtain the i 




intra-block estimate of a missing 
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value in the BIB de sign. The intra-bloek estimate of the 
missing value x ±j is 

A vr (k-1) B^-k (v-l) q^_ (v-1) q<^ 

x^j = kTk-1) "(bk-b-v+l)- L (5.118) 

where Q'j is the SUITl of the Q values for all other treatment (but 
not the jth one) which are present in the ith block. If more 
than one observation is missing the same procedure as discussed 
in the case of the randomized block design and the latin square 
design may be adopted. As usual one degree of freedom per missing 
datum is deducted from the error sum of squares of the intra¬ 
block analysis of variance. Also one may introduce concomitant 
variables composed of zeros and ones for the missing values and 
perform the analysis of covariance (Rao, 1947). If several 
observations are missing it may be advisable to analyse the design 
as a completely randomized design. 

5.4.3. The Efficiency and the Optimality of the BIB Design: 

From Theorem 5.7 we can conclude that for any estimable contrast 

A A 

ill = I c.t.. the variance of its unbiased estimator ilj = E c ,t , 

j J J j 3 3 

is given by * 


Var(^) = V I c \ 

j J 

where 

V = E[(v_l) -1 Z (t.-t ) 2 ] - (v-l )" 1 Z (t - t ) 2 

j 3 * j J 

= EHre )" 1 (v-l)" 1 *^] - a 2 T 
* (re)" 1 a 2 , 

where Q' is the random variable corresponding to g 2 in Table. 

5.9. Thus 

Va r(M = (te)" 1 o 2 Z c. 2 (5.119) 

j 3 

If we have conducted the experiment in the randomized block 

de sign with the same number of treatments v, the same number of 

2 

re Plications r for each treatment and the same a , then the 
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unbiased estimator of ip is 

/ 

? = £ °j x .j 

j 

with 

2 

Var (40 = —— Z c -f 
j 

So the efficiency of the BIB design 
block design is 



{ 

t 

(5.120) 


relative to the randomized 


Var (40 _ e (5.121) 

Var (40 

Thus the quantity e , as given in (5.112) is called the 
efficiency factor of the BIB design. 

To facilitate the derivation of optimality results we need 
to develop the analysis of a general incomplete block design with 
r^ = r and all k^ = k (which is not necessarily a^binary design). 
Let 


G = £ x. ., 

(i, j) eS 


T 



• / 



B 



• • / 



Q~ (Q^#.../Q v )'# t, — (t^,...,t^)' , ql = ( a 
I (n) = n x n identity matrix, 

0 n = n-vector whose components are all equal to 



unity. 


From (5.102) we get 



■multiplying both sides of (5.122) by 

V 

' b f i 




( 5 . 122 ) 


(5.123) 
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n'V 1 


I (V) 

-r -1 N 


-k _1 N 

1 (b) 


obtain 


Q = (rl (v) - k -J -N'N)t 


(5.124) 


= C t 


where 


C = rl (v) - k 1 N'N , 

N i is the incident matrix 
Using (5.123) we get 

ce = 6' c = o 

V V 


(5.125) 


So we confine our discussion to the case where the matrix C has 
rank v—1. Thus this design is connected and all contrasts 
tj-tji(j^j') are estimable. It is now left as an exercise to 
show that ( Note ; here Q is a random vector) 

Var(Q) = Co 2 . 

Since for any general incomplete block design 

1 n. . = bk, 

■< iJ 


tr C = bk - k‘ 


is maximized by the balanced block 


design whenever it exists. The C matrix of such a design is of 
the form hypothesized in (5.8 8) . Thus the balanced block design 
is A-, D-, E- and L-optimum. In particular the randomized block 
design and the BIB design are A-, D-, E- and L-optimum. This 
proof is due to Kiefer (1958) . 

For binary designs Mote (1958) proved the E-optimality of 

des igns and Kshirsagar (1958) proved the A- and the 

^-optimality of BIB designs for estimating orthogonal contrasts 

dn <3 also of the Youden square design. Roy (1958) established the 

ec essary and the sufficient condition for a binary design to be 
most cff: 

^■•-.cicient with respect to the A-optimality criterion for 
timating all elementary contrasts and proved that in the case 
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of proper binary incomplete block designs a most efficient design 
if it exists, is necessarily a balanced incomplete block desi gn 

5 . 4 . 4 . Recovery of Inter-block Information : Since some 
incomplete block designs have low efficiency factors e , 

Yates (1939, 1940) suggested the use of informations available 
from interblock comparisons to increase the precision of estimates 
of treatment effects and showed how this can be done for the cubic 
lattice design (19 39) (to be discussed later in Chapter 6 ) and BIB 
design (1940). Nair ( 1944) gave the method for a partially 
balanced incomplete block (PBIB) design (to be discussed later in 
this chapter). Giri (19 57) gave the method for the row balanced 
PBIB design. Rao (1947,1956) adopted this method for any 
incomplete block Resign. Later Roy and Shah (1962) and 
Shah (1964) examined the problem of recovery of interblock 
information in detail in the case of a general incomplete block 
design. 

The process of recovery of interblock information consists 
of applying the method of weighted least squares to intrablock 
and interblock contrasts of observations for the purpose of 
estimating the treatment effects, the weights being inversely 
proportional to the variances of these two contrasts. The ratio 
P of the interblock variance to the intrablock variance plays an 
important role and since this is usually unknown the ratio of 
estimates of these variances Obtained from an analysis of variance 
of the observed data is substituted (Yates (1939,1940)) . If P 
were known, the combined estimator would have all the good 
properties of the usual least squares estimators. Since only an 
estimate of p is used, obviously the properties of this 
combined estimator are dependent on the type of estimator of P 
used. Graybill and Weeks (1959) showed that for a Bid design the 
combined estimator of a treatment contrast based on Yates' 
estimator of p in its untruncated form is unbiased. Graybill and 
Seshadri (1960) proved the same result with estimator of p in it® 
usual truncated form for BIB design, shah (1969) has obtained an 
estimator of p which produces a combined estimator which is 
unr ormly better than the intrablock estimator for any treatment 
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t rast for a 3IB design with more th™ c 
giinpT e lattice design with 16 ^ -« • . 

Lattice design with 9 treatments or more. P 

j* Now we shall treat here in detail 
W . ■ , aetaii the case of a BIB design. 

fiTe-i. can be easily extended to Youden souaro ^ • 

V xt ^ , en s< ? Uar e designs, the PBIB 

9 i& igDS and rOW lanced EB1B designs, it may be remarked 

tnat if we stlck to the fixed-effect model of the BIB design as 

given in (5.100), then intrablock estimators are optimum by the 

gauss-Markof f theorem (chapter 3) . The problem of recovery of 

interblock information to obtain more efficient estimators of 

treatment contrasts arises if we consider the block effects a. 

in ( 5 . 100 ) to be random variables a^ instead of unknown 

constants satisfying z tti = 0. 

The new mathematical model which we shall now adopt for the 
recovery of interblock informations is not usually realistic for 
field experimentations where the a^ are random variables only 
because they are random permutations of the fixed block effects 
of the b blocks actually used in the experiment, in this case a 
randomization model already discussed in 5.2 and 5.3 would fit 
better. Our assumption for the recovery of interblock 

r 

^information is as follows. 


i /• c (5.126) 

|x ' = y + a A + tj + e^y (i,j)eS 

f where the e. . are independently, normally distributed with mean 0 

) and variance'a 2 and are independent of the a The a are 

n ^ci-ributed with mean 0 and the variance 
independently normally dist y = 0, because the 

: and Z t = 0. We cannot assume t a . _ i 

j j 

a. are independent. 

We shall first observe that under the model 

We shall first teBt statistic (- v _i—>Q 3 

intrablock estimators t. and 

.riables corresponding to q 2 end q 3 

“here Q and Q 3 are random v h _ ve the same distributions 

‘ Tespectively (as given in Table ‘ tri bution of the t^, Q 2 and 
as under (5.100). Since the joint dis tribution under (5.126 

r- °3 «n be regarded as the conditional 
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given the a., the constant ji and the a. in (5.100) corresponding 
to the fixed values of the a^ according to ou — a^ - a ./b # y = ^ 
a./b. Since this conditional distribution does not depend on the 
values of y and the a^ and hence not on the a^, it must be same 
the unconditional distribution of the tj * Q 2 and Q^* 

Let tj denote the least squares estimates of the t^ from th e 
block totals 

B. = 2 n (y* + a + t + e. .) 


(5.127 


= ky' + 2 n. . t. + f . , 
j ij J 1 

where 


f ± - + j n ij • 


i — 1# • • • § b 


(5.128 


The f ± are independently normally distributed with mean 0 and the 
common variance a 2 = k 2 a| + k a 2 . Equation (5.127) represents a 
fixed effect linear model in terms of the random variables B., 

i “ 1, ...,b, (we ace using here the same notations for the random 
variable and its values) with 


•E(B i ) = k y' + 1 n,. tj , 

Var(B.) = k 2 o| + k 0 2 . i = . 


Now minimizing 


s 2 ^‘V-’-' -n ij V 2 

wit ^respect to u ■ and tj we obtain 


■b y = k 2 B. 


kry + 2 x 


'J V V = 


where X = v 

j j * n n n -i H 1 and T ' is t-h- 

containing the 1 t reatment j ,p rom ^ ^ ^ bl ° Ck t0talS 
U' + (r-x)f s (5.104) and (5.131) we get 


(5.131) 


(5.129) 


(5.130) 












mt' 
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k' 

* ^ 


From (5.131) - (5.132) we get 

£ = (r-^i *" 1 (T' -rb” 1 ! b.) 
fc j j i i 


(5.133) 


p Since ^ ®± 2 j • ^he interblock estimate of any treatment 




11 \ 3 contrast 

$ = £ c 1 t i ^ c i ~ ® is given by 

j j j j j 

<5. la J (-A )’ 1 J OjT-j . 


■ 128) ' by 


If 2r > b, it is easier to compute (t-X) *T and use $ 

= _ (r-X) * E c .T . The error sum of squares based on the 

J j 2 

observations is the minimum value of s in (5.130) and is given 


a *<3 the t ss f = Z B \ ' S B . - Z t T- 

1 1 J 




^Sents a 

B i' • 
e random 


l B 2 - (r-X) 1 I (T') 2 + v(fc r » ( E B.) 2 
i A j J b (v-k) . i 


(5.134) 


* 


as E T' = Z Z n. B. = k Z B., (v-1) X = r (k-1) . If b > v „ by 

j j j i 1 i 1 

the general theory of Chapter 3 


'2 SS f 


(5.135) 


V cT = - 

(5 129) f b—v ^ 

S- ’ fc is distributed as (b-V)" 1 o\ X*_ v and is independent of t. and 

4-- hence of *P. 

< 5 - 1301 i Theorem 5.11 . Under the model (5.126), the intrablock estimator 
J and the interblock estimator J of any contrast * = S c j t j are 

■ 

independent. 

. I Proof. We shall use wherever convenient the 




for the random vari 


table and its values. As the t. and V are 


constants * 


Cov(B. 


$ \ 

I s * 


k,Tj) = Cov( X' J, n i , j X i , J ) 

= E E n,, ,n. M Cov(X i;j ,» x i »j > 

JV 
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- * 1 n " , Cov(b. + e ,b., + e il;j ) 

i ' j ' J 

= I l n..,n.,.6. ,(a| +6 a 2 ) 

ij' i'J ii' B JJ 

1 J * 

- "ij * n ij ' <0 S +6 jj • ° 2) 



« 


f 

s 


= »ij< k °B + ° 2 > 


= 

ij k 


(5.136) | 


Since the a. and the e. . are independently distributed with 
1 2 1 ^ 2 

zero means and variances c and a respectively, using (5.135) 


= Cov(T .-k _1 Z 

J i' 

h i'j 

B i- 

= Cov (T B.) - 
i 

k Z 
i' 

Vj 

V -1 2 

“■ k n ij °f - 

,.-1 2 

k Z 
i' 

. -1 

n i'j 

= k n.j o f - 

k 

n ij 0 


as the B i are independently normally distributed with 

■C°V<VB t ,> = 4^, o| . 


(5.137% 1 

V? 

i 


Since (Q J ,B i ) have a Joint multivariate normal distribution from 

(S.137) the Qj and hence the t are independent of B. . Thus I, 

being only a function of the t . is independent of J function 
of the n J 


of the B A only). 


Q.E.D. 


Theorem 5 r l? var ($) = Z c 2 — 

j J .. r-X 


t i 



Proof. Since $ = Z —i- , 
a j r " A 

Var( *) = lr - M ' 2 J J, CjCj. C ov(Tj,Tj,) 

But 


(5.13 0 ) 





5 -l36) 


EXPERIMENTAL DESIGNS AND THEIR ANALYSIS 261 

Cov(T!,Tj,) = Cov(E n B. £ n n 1 

J J i ij 1 it "i*j• .> 

= ZZ n n. 6 _2 

i i i l' j 1 ii . CT f 


= £ nn.. =A _2 

i i f i i ^ j , , G - 


x ij‘ "f " A jj« a f • 

From (5.137) and (5.138) , since Sc = 0 we get 

j J 


2 

* 2 

VarW) = I -i- . 

j ^ A 


(5.139) 


Q. E. D • 


5.1371 


re 

w— « / 


= IrzAL 
2 


2 _2 


(5.140) 


where a f = k a g + k a . Since for any contrast \p= Set. its 
intrablock and interblock estimators \p and $ are statistically 
independent, the linear estimator 


ip* = 


A A 

03^+0) • Uj 


(5.141) 


is the best (minimum variance) unbiased estimator of if we 

know a and a£ we could obtain ^*. In general they are unknown. 
The original method of Yates (1940) is to use but with the 
weights obtained by estimating a and from the data. The 
usual estimate of w is obtained from the error of the BIB design 
(Table 5.9) and is given by 


in _ (bk-b-v + 1) r 


(5.142) 


Though one can estimate w* by (see (5.135)) 
ui = jr-1) (b-v) 


w 

jjSl \ 


The usual choice of co• (Yates (1940)) is given by 


where 5? is a n unbiased estimate of o\ obtained through the mean 
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sum of squares due to blocks eliminating treatments. This ^ 

obtained from the following analysis of variance table for 

testing H : all a. =0 (see Table 5.9 for notation) . 

3 a 1 



Table 5.10.Test of H a for flIB design 



i 


S 


Blocks 

(eliminating 

treatments) 


b-1 


U 2 (by 

suotraction) 


u. 


bk-b-v+1 “9 

~^rr- q- 


Error 

(intrablock) 



aeassSLSUi. under the model (s. 100 ) 

W /rt«« #• . a 


(5.143) 



E (SS (bl.adj) = (b-l) a 2 + ( bk . v) a 2 

KT.i_ _ B 


(5.144) i 


fi2te- In what follows we shall !L e th 
estimator and its corresponding estimate!*" 16 n ° tati '° n f ° r the 
Proof - As gg(bl-ad-f^ 

chisquare with b-l degrees “ diStributed as a noncentral 
X < sa y), we get r eedom and noncentrality parametei 

EJSS (bl-arH \ \ 

(5.145) 


i. 


' 



So X is 

noncentrality paraneter of th 
P = ssjbl^dj, statistic 


I 

■ 

I ’ 


te8tin g H a; ail ' 

i 0. Since for test 


ing H a # the P ower 


of 
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V 1 : a t nd ° 2 ’ we - nclud " U ^ C y a -— except 

p‘ and t j' J 1 .. So to evalu t * ° not depend on 

Without any loss of generality t„u * E (sstbl .aaj)) we can 


Without any loss of generality take ^ * (SS(bl * ad J)) we can 
Now under the model <5.l 26) the _ ' %i = ° £ ° r aU U 

conditional expected value of ss (bl ^ eS | XQn (5al45) is the 
since this depends only on a 2 and a ^ /0 9iven the and 

the unconditional expected value of SMbT'^sw 2 ° nClUde that 
taking the expected value of the riaht h 1 ° btalned by 

respect to the joint distribution of the a. which a>e 
independently normally distributed with mean O' and'the same 
variance o . will depend on o 2 and a 2 , under the assumptions 
|f= o, tj = 0 for all j, the treatment totals^., the adjusted 
treatment totals and the block totals B i all have nean 0. 

Hence, from (5.144), we get 

E(SS (bl.adj)) = bk-^VarfB^ + k x"" 1 Var (Q^ ) 

- r""\> Var(T^) . (5.146) 

Using (5.117) and (5.126) we get 


Var (B 


i 2 ,22. K 2 

.) = Of = k ° B . Ka ' 


2 2 

Var (Tj) = r (a B ,+a ) 


. r x r (k-1) a' 

Var(Qj) = Var (retj) - 1 k 

From (5.146) - (5. 14 7) 

E(SS (bl .adj)) = (bk-v) o B + (b-l)o 

2 

Hence the unbiased estimate ° B ^ 

A 2 . -1 / ^1 .adjj. _ "(bk-b-v+iT ^ - 

0 ^ = (b-1) (bk-v) ( " b^l 
° 2 

Thus the usual unbiased estim 

*2 1 .x (bl.adj)--nj^.b-v+TT ^ 

* f 2 = k(bk-v)" 1 Ck (b-1) —bTl ^ 

Hence 


(5.147) 


Q.E.D. 


(5.148 


(5.14 




264 ANALYSIS OF VARIANCE 


AA A ft 


u)^+uj 1 4i 

, the contrast * using the intrabl oc)t 
is the required estimate This is essentially what y at 

and.the interblock informa i ^ (1960 ) have shown that J* ls 

« M »> aia. - - a b tri h nd Weeks (1959, have shown that *. 
unbiased for * “ d * sufficient statistics. Rao (1947) has 

te^ed n the’»*i»u» likelihood estimates, though Sprott ( 1956) has 
Shown that this is not true in general for all incomplete block 

designs. 


ZJ 

From (5.150), using (5.110) and (5.133), we get 


= ic.t,* 
* j j j 


where 


oi tj+(*)'tj 

' " /\ /V 

W+£l' 1 

A ^ j ^ _1 

w — + w'(r- A) P. 


(5.151 


A 

k Q j + P P j 
r (k-1) +\ +£ (r-A ) 

where 

P 4 = T' - rb ’ 1 ZB., 
j J .1 


- rk e ^ 1 
(r- A) w 


k a£+a 2 


v(r-l) [(bk-b-v+l )' 1 q ] 

. k SS(bl.adj)-( v -k)(bk-b- v +l)“ 

r © p is the Yates estimate of 


(5.15 2 J' 




(5.153), 


£ 




V 

V\ 
y \ 

C l(n % ^ 

, N ’’ ■ 

Sm 

,\y 

\ s 


From (5-141) 


Var (<|J*) = 2 C, ( oi-t-co *) ~ 1 

J J 


Thus 


Var (t*-t* ,) = 

J j w+w' 


r (k-1) + A +p (r-X) 

Statistical inferences about the t^ are made by assuming 1 
u)=w, a)'=(!)' and p=p. In other words for all C. with I C. 
is normally distributed with mean \b and with variance"* gi 1 
(5.154) with o)=w, w'=w' . 


Example 5.7 . An experiment with 13 different variet 
Paddy (rice crop) was conducted at the Agricultural Resea 
Institute, Rajendra Naaar, A.P.. India to find the best v 
suitable for the tr 
A= 1 was adop 
with yield pe 
number inside 


T 


.0 

1 







mjfljff 
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34.4 

(3) 

38.2 

(1) 

28.7 

( 2 ) 

36.6 

( 1 ) 

31.8 

( 1 ) 

30.3 

12 ) 


32.4 

(5) 

32.9 

( 2 ) 

30.7 
(4) 

31.1 

(4) 

33.7 

( 6 ) 

31.5 

( 6 ) 


33.3 
(7) 

37.3 
(3) 

26.9 

(9) 

31.1 

(7) 

27.8 

( 8 ) 

39.3 
(7) 


36.9 

(13) 

31.3 

( 10 ) 

35.3 
(13) 

28.4 

( 11 ) 

41.1 

(13) 

26.7 

( 12 ) 


Table 5.12, Intrablock and combined estimates 


Treatments k T^ 


k Q. Intrablock Combined 

^ estimates estimates 

kQ .+.4517 P 


14.3551 


565.2 523.3 

476.8 496.4 

480.0 474.3 

448.8 470.6 

491.6 489.3 


449.2 

508.4 

508.8 

449.6 

450.8 

358.8 

446.4 

559.6 


484.0 

509.1 
457.6 

459.5 

473.6 

427.1 
442.4 
486.8 


41.9 

-19.6 

.5.7 

- 21.8 

2.3 

-34.8 

-0.7 

51.2 

-9.9 

- 22.8 

-68.3 

4.0 

72.8 


.805 

-.376 


-.419 
.044 
-.669 
-.013 
.984 
-.190 
-.4 38 
-1.313 
.076 
1.400 


4.39 3 
-.738 
.329 
-1.763 
.565 
-2.187 


2.973 
-1.223 
-1.678 
-6.311 
-.79 3 
5.397 
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Table 5.13. Intr a blo ck analysis of variancfi 


Sources o3 
variation 


f Blocks 
' (ignoring 
Ij treatments) 

V Treatments 
{eliminating 

blocks) 

f 

Error 

(intrablock) 

Total 


d.f. 


Sum of squares 


689.38 


328.55 


538.22 


1556.15 


Mean sum 
of squares 


27.38 


19.93 


1.374 


The E-value Is not significant at the level of significance a <.10 
The estimate" of the intrablock variance of t.-t. f , j ^ j •• ± s 
Var (tj-tj,) = 12.27 

SS(bl.adj) = 689.38 - 542.66 + 328.55 = 475.27 
3 = .4517. 

The combined estimate of treatment effects with recovery of 
interblock information is 


x 

A A kQ. 

, ‘J = — 


.4517 P. 


14.3551 


A A 

The estimate of the variance of t*-t*, , for j ^ j ', is 

A /V ^3 

Var(t*-t*,) = 11.1049 . 

5*4.5. Randomization Test in BIB Designs : A BIS design is a plan 
of a statistical experiment where v treatments are assigned to 
^perimental plots arranged in b blocks of k plots each such that 
^ch treatment occurs exactly once in each of the r blocks and 
pair of treatments occurs together in v blocks. The treat- 
mei >t arrangement in the corresponding plan follows exactly the 
same rule. For clarity we shall call the blocks of the plan as 
9 t oup S to distinguish them from experimental blocks. The 
Ca ndomization procedure in a BIB design can now be described as 
f01 lows t 


R : the b groups - 
1 blocks. 

experimental bloc x are distributed at ra 

R, = the * “^Tthe experimental blocks and that i s * 

to the k plots of th block. » 

.enarately and independently tor 

of the randomized block designs and the Ut | 
As in the case ^ investig ate the extent to which 

square ^, bution of the statistic U = Q/lQ^Qj). w hete 

rT Ire°rLdom variables corresponding to g *3 ^ S.i. 

32* u 3 . 4 .t,« distribution of U for all 

,a n be US ed to approximate the distriou 

nts under (R., R 9 ) when the hypothesis H ( all 

oossible arrangements unaer \ y 2 1 

treatments are equal) holds. It may be remarked that this c out5e 

Ls only followed when the normality assumptions in <5.100)are i B 

ioubt and the hypothesis H T is tested by using the randomization 

rest. To be exact, one needs to compute the value of U for all 

possible arrangements of the treatments to the experimental plots,; 

;f the value of U, obtained for the layout actually used in the 

jxperiment is exceeded in ot percent of the cases we reject 

it the level of significance a. Of course it is a very 

.aborious job and we naturally look for approximations. 

% 

Under the model (5.100)/ the statistic U under H_ has central 
>eta distribution with parameter (*s(v-l), ^(bk-b-vH) )• Thus 
mder 

'v 

:(u) - fer < I 


far (U) = (bk-b-v+1) 

[b(k-l)] ^[b(k-l)+2] 


= 2Jv-l) (bk-b-u+n 
(b (k—1)) 3 

= 2vr(Xv-k) 

v 2 b 2 (k-i) 


if b(k-l)>> 2/ 


litra (I960) has shown 

in< * under h under the randomization model ^ 




_ jV-1). 
E(0 “ b(k-l) 


l ' h O 

3 • ’•'w. 


where 


2r (Av-k) 
k 2 b 2 (k-l) 




(s i - i) 2 /s 2 


(5.157) 


\ (all 

1S Co ^s e 
0) ^r e in 

f or all 

tal Plots, 

in the 

set H_ 

T [ 


central 

Thus 


= Ux i4 - ri ) 


b 

s = E 
i=l 


5 -l 


When H t is true, while the mean of u under (R ,r ) agrees 
perfectly with the mean of U under the model (5.100), the variance 
of U would also be practically equal if b(k-l)» 2 and the block 
errors are homogeneous (6=0) . Here the test based on U 
(equivalently the analysis of variance F-test) works reasonably 
well as an approximation to the randomization test. Heterogeneity 
in block errors reduces to variance of U. Hence if 6 is large 
-J>e approximation by the beta distribution for testing H^ would 
jenerally underestimate the significance of departure from H T 
ind would thus be somewhat conservative. 

|> This situation ±s similar to that of the randomized block. 

f 

design and in some sense possibly better. When H^ holds the ratio 
of the variance of U under (R^,R 2 ) to that of the corresponding 
normal theory beta distribution is approximately equal to 
1 - 6/(b-l) in a BIB design and to 1 - 6/(r-l) in a randomized 

block design. In BIB designs b > r and in many cases b is large 
compared to r. 

The types of departure from H T that can possibly be detected 
by the normal theory beta test (or equivalently the P-test) have 
been examined by Rao (1959) through a comparison of the nonnul 
expected values of the treatment or errer mean squares. 

C . — Kc-i^noPd BIB designs) : 

♦ 4 -6. Youden Square D esigns. (£ --- - ^ possible to arrange 

the case ° f a BIB deSign “ ith b “/'tL blocks in such a way 
the b blocks and the treatments withi 
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lete latin square design. In Exampi e 5 | 

that they form an inC forms an incomplete latin square d e 

following arrangement terms •»*. 

j 

Blocks 



Smith and Hartley (1948) have shown that for all incomplete block 
designs with b = v such an incomplete latin square arrangement 
always exists. These incomplete latin square designs are called 
Youden square designs after Youden (1940) who first discovered 
them. In other words a Youden square is a latin square with 
several rows removed, the remainder having the property that its 
columns form another BIB design. Cochran and Cox (19 50) obtained 
interesting designs by combining Youden squares with latin squares 
when the number of treatments is small. 

If we treat the position in a block in the BIB design as a 
third factor then a BIB design is an incomplete 3-way layout, the 
three factors being blocks at b levels, treatments at v levels and 
positions at k levels. It may be desirable to balance out the 

case^rather ^ P ° Sitlon eff ect (if possible) in such 

“run" rand0mi2in9 ifc ° Ut it increases the error 

If we impose the re^ri 

equivalently r = km) that each treat 3 " = 

(m times) in each position th 3ppears e< I uall y often 

another BIB design which we shin” 6 ° btain * rearran < 3 eme nt into 
design. Obviously an m ™ Cal1 an m -P°sition balanced BIB 

3 Y ° Uden square design.',, * ^ balanced BIB design with m - 1 is 

“towed that for any B IB desic! 7, ■ Shrikhande and Taylor (1953) 
rearrangement leading **»« *> = » there always exists a 

“-position balanced design. 
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The linear model assumed fnr •t-v , 

1 ^umea tor the analysis of a Youden square 


sign is 


.= U + a . + t +3 0 +e 

i* i j * i 


(5.158) 


where x ±jSi , i 1, . .. ,b, j _ i, . . . , v , z = i, ... , k is the 
observation on the jth treatment in the ith column (block) and Ath 
row (position) and y is the grand mean, i s the ith column 

effect# tj is the jth treatment effect, i s the ith position 
effect and the errors e_^ are independently normally distributed 
with means 0 and common variance a 2 (and Z a. = 0, £8 = 0, 

i i l ^ 

£ t. = 0) . If the interblock information is to be utilized to 
J J ^ 

estimate the treatment effects then it is assumed that the cc ■ are 

independently normally distributed with mean 0 and common variance 
2 

and are independent of the e^^. 

The solution has already been obtained (in the BIB design) 

because the treatments are orthogonal to rows. The estimate is 

given in (5.152). The intrablock analysis is similar to that of 

the BIB design except that from the error sum of squares we 

subtract the sum of squares for rows with k-1 degrees of freedom 

to get a new error sum of squares wirh bk-v-b-k+2 degrees of 

-12 

freedom. The sum of squares for rows is given by v £ p -C.F., 

p P 

where p is the total of the observations in the pth row 
P 

position. 

In the setting of two-way heterogeneity (with additive 
effects model) with k rows/ b columns and v treatments, we say 
that a design 'd 1 is a "generalized Youden square" if it is a 
balanced block design when rows are considered to be blocks and 
also when columns are considered to be blocks. A generalized 
Youden square design is called regular if at least one 
Vv or b/v is an integer. Kiefer (1958) has proved that 
r egular generalized Youden square design is A-, D-, E , and 
^-optimum. This implies that Youden square designs 
«- -d L-optimum. If b - k - v, then a generalized Youden square 

design is a Latin square design. 
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s.4.7. uith ° ut . recov « 


. 4 . 7 . analysis - _ f btb design without recovery 

discussed the covariance an ^ ^ notations of (5.100) the model 
Of interblock information. des ign with one ooncomi tailt 

for the covariance analysis of 
variable can be written as 

J (5.15 

x = M »i * l ( * 7 z iJ + e U 

lJ J , _f covariance for testing H : all 

where (i.j)cs. The analysis of 


t =o is given in Table 5.14. 

« onoiveic of BIB designs without 
Table 5.14. int erblock' information_ 


- Sum of products Adjusted S.S 

Sources of j if v ■ j f b o 

° “ v7 zz d.r. i.o 

variation 


Blocks 

(ignoring 

treatment) 

Treatments 

b-1 

B xx 

B xz 

B 

zz 



f-1 

(eliminating 

blocks) 

v-1 

T 

XX 

T 

XZ 

T 

zz 

v-1 

q 4 q 3 

v-1 

E 2 

Error 

f= 

E 

E 

E 

f-1 

q_=£ - 

XZ 

(intrablock) 

Treatment 

bk-b-v+1 

XX 

XZ 

zz 


^3 xx 

E 

zz 

K 2 

(el .blocks) 
Error + 

f+ V-1 

W xx 

W 

XZ 

W 

zz 



XZ 

W 

zz 


where B 


= k I B i (x)B i (z) - b _i k~' L G(x)G (z) , 
* (re) ' 1 z Qj(*)Qj(z), 


XZ ’ (i.J)es X ij Z ij- k ' i l ^WBiW-Ire )- 1 l Qj (xJQ., 

B i(k) = E x. . B ( , _ ^ 

3 1 j 1 i z ij- G(x) = £ 

G < z >= 1 3 (i,j)e S ^ 

(i>J)eS iJ ' 


j (x) - Qj, the adj usted i 


variables x 


Jth treatment 


total with respect to the 
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'*7 With r-o. 

®zz are the va l“es Of B w , espect to the variables z B 

T E , etc xz nen z = x and v - i J' xx ' 

a xx' xx' ■ tc - na x - 2# and similarly for 

The covariance anal ysis of v 

similarly treated. Covariance a ° Uden SqUare flesigns oan be 

recovery of interblock theSe desit Jns with 

detailed treatment we „ 100 P ° se seV6ral problems. For a 

refer to Federer (1963) 

5.5. PARTIALLY BALANCED INCOMPLETE BLOCK (P BIB) DESIGNS 

bib desi9ns in the i0 n. a,^ 

equiDiock-sized, equireDlirat-^ . y 

o -u incomplete block designs ^he BIB 

designs have several nnHm,™ y 

optimum properties. Unfortunately BIB , 

esigns o not exist for all combinations of parameters that we 

might wish to use. Since X = r(k-l)/(v-l) is an integer, if V -1 

is a prime number, r has to be a multiple of v-1 and there is a 

further restriction that b = rv/k has to be an integer. 


The main characteristic of a BIB design is that Var(t —t ) 

j j 

has the same value for all j ^ j 1 . An alternative to BIB designs 
is to find designs in which this equal variance requirement is 
modified somewhat. Such designs were introduced by Bose and Nair 
(19 39) and are called partially balanced incomplete block PBIB 
designs. Their idea has been developed considerably since then 
mainly by Bose and his associates among others. Bose and 

Schimamoto (19 52) introduced the concept of partially balanced 

_ j _ v,*--acod the definition of PBIB designs 

association scheme and rephrased tne 

in terms of this concept. 


-• -z.. c f. Partially balanced association. Given a set of 

imition 5.6. paruiux y 

--. . x , v , a relationship satisfying tne 

iatments (symbols) . partially balanced 

.lowing three conditions is called P 

iociation scheme with m associate c 

set are either 1st, 2nd,..., 

Any two treatments in t e ae;c , 0 ciation is symmetrical, 

a t h e relation of a f the treatment 

i associates and the ^ ith associate of the 

it is, if the treatment a is e treatment 

4 th associate oi 

then 6 is also the itn 


a. 
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27 * * . set has exactly n ± treatments in the 

(ii) Each treatment tn^o^te ^ the number H does not depen, 
. are its. it 


set which are its. ... 

on the treatment chosen. 


3 n the treatmen ^ ^ arg associates, then the 

(iiil If two treat ^^ S a “; both jth associate of a and kth 

number of treatments .w ^ , ndepepden t of the pair of ith 

associate of 3 lS p jk = i _ T h e numbers v , n i' 

associates chosen. ^Jence parameters of the m-assoeiate 

partially balanced scheme. 

5 7 A PBIB design with m associate classes is an 

treatme :r e part ially^alanced^associat ion scheme 

according to an m-assocrate partially o 
such that: 

a) every treatment occurs at most once in a block, 

b) * every treatment occurs exactly in r blocks, 

c) if two treatments are ith associate, then they occu^ together 
exactly in exactly X. blocks, the number X ± being independent of 
the particular pair of ith associate chosen. It is not necessary 
that the X ± should all be different and some of the X ± may be 0. 

The parameters v,b,r,k,X,X^, n^, ..., n m are called by 
Bose and Nair the parameters of the first kind and the are 

parameters of the second kind. 


Obviously 
bk = v r . 


(5.160) 


Since with respect to each treatment, the remaining v_l treatments 
are classified as the 1st, 2nd, ...,mth associates and each 
treatment has n^ ith associates, we have 


E n. = v_i . i 
1 1 

Consider the r blocks in which 


(5 . 161 ) 


Consider the r blocks in which a particular treatment v occurs. 

In these r blocks we can form r(k-l) pairs of treatments each 
having a as one of its members. Among these pairs the ith 
aeociate of a must occur X times and there are n. ith assocl* te 
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r 

p 

Hence 


? n A. * r (k-1) . (5.162) 

i 1 

1 

Le t the treatments a, B be ith associates. The kth associate of 

a x = should contain all the n. jth associates of $(j?*i) • 

' j 

When j = i# a itself will be one of the jth‘associates of 6. 

Hence the kth associates of a, k = 1, ...,m should contain all the 

( n ,_1) jth associates of &. Thus 


m 

I 

k=l 




6 


ij ' 


(5.163) 


where 6^ * 1 if i = j and is 0 otherwise. 

Let G. denote the set of^ith associates, i = 1, ..,.,m of a . 

1 i • ' 

p or i jL j, each treatment in G. has exactly p.. kth associates in 

G.. Similarly each treatment in Gj has exactly p^ kth associates 

in g. . Thus the number of pairs of kth associates that can be 
i ' 

obtained by taking one treatment from G i and another from G^ is 

on the one hand n^pj^ and * on the other hand n^p^ . 

Hence 


n.p* 

i*jk 


n jP±k = n k 



(5.164) 


Among all PBXB designs, the two-associate PBIB designs are most 
popular in practical applications and most research works, carried 
out so far, have been concerned with two-associate PBIB designs. 

We shall be concerned here with two-associate PBIB designs. For 
construction and other relevant results of PBIB designs we refer 
to Raghava Rao (1971) . Extensive collection of PBIB design with 
two-associate classes are given by Bose, Clatworthy and 
Shrikhande (1954). Bose and Schimamoto (1952) classified all the 
then known PBIB designs with two associate classes in the 
following five types depending on the association scheme. 

(1 ) Group divisible (or divisible partial designs by Dembowski 
‘1968)), (2) Simple, (3) Triangular, (4) 1-tin square type and 
^5) Cvel i 
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munition 5.8 . 'A PBIB design with two associate classes is call 
group divisible (GD) if v = mn, where m, n are integers and if 
these treatments can be divrded into m groups of n treatments 
each such that any two treatments of the same group are first 
associates and any two treatments of different groups are second 


associates. 

For a GD 
n 2 = n(m-l) , 


design the parameters are v = mn, n 1 = n-1. 


p 1 = (phi = < n " 2, 0 

1J 0, • n(m-l) 


), p 2 = < 0 


1J 0 , • n(m-l) n- 1 , n(m- 2 ) 

For a GD design r >Ap rk - A 2 v 0. The GD designs can be 
divided into 3 classes (Bose and Connor (1952)), singular designs 

(r = Ajj, semiregular designs (r > \ 1 and rk = * 2 v ) and regular 
designs (r > \ 1 and rk > \ 2 V ) • 

Example 5.8 . The following blocks v 

(15,9,1,13), (5,7,8, 1 ) , (10,1, 14, 2) , (15,11,2,13), (6,15,4,7), 

(12,4,3,1), (12,14,15,8), (6,3,14,15), (5,4,2,13), (10,12,13,6), 

(9.7.10.3) , ( 8 , 6 , 2 ,9), (5,9,11,12), (7,13,14,11), (10,4,8,11) , 

constitute a GD PBIB design with v = b = 15 r-k -4 j , 

A i = °- pl =«- p2 » <°; |). ' 2 = 1 ' 

Example 5.9 . The following blocks 

(1.2.3) , ( 3 , 4 , 5 ), (2,5,6), (1,2,4), (3,4,6), ( 1,5 6 )~ 

orm a GD PBIB designs with parameters v= b = 6 r k - 3 

\ = 2, A 2 = 1 , pi = (0, 0, 2 = 0, 1 t, r _ k - 3, 

( 1 ,2), (3,4), ( 5 , 6 ) . °' 4 ' l l, 2 '* The three groups are 

Ss finltion 5 9 . a two-associate PBIB • 

x 2 / 0, or X ^ 0 X - n BIB d lgn with either X, = 0, 

1 ' 2 - 0 w called Simple. 1 

A simple PBIB design may be a i.„ 

GD PB1B design given in Example 5 H °' ’ 

Ss finitior v „ p e 5.8 ls sim pl e . 

lf v ‘ "(n-1) and the as *°° late PBIB desl gn is called triangul? r 

the v tr eatments as elements 1 ^ tlon sc heme i s obtained by writing 
square sments above the . 

S’ ay ° f side ", leaving the nr' P * 1 diagonal of 3 

> principal diagonal blank and 
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£ symmetric array. below the aiagona , ^ 


^ symmetric: array. 

J> For v = lo, n = 


5 an{ 3 We have 


6 7 

8 9 


3 ° nal in or ^er to obtai, 


the follow itg arrangement. 


4 7 9 10 

F° r an ^ Vestment a, the first a^ 

occur in the same row or t-v, s °ciates are exactly those which 

associates of 1 are l2 3 ,Tl ??< ^ ** *»• 

t are (8,9,10). ' ' and the second associates of 1 


-7), f For this association scheme n^n-Z) , n 2 =Y<n-2) (n- 3 ) 

'J. 3 ' 61 ' j P 1 = (P-j) - < n ' 2 ' , n ' 3 ). P 2 - , p 2., . , 4 ' 2(t '- 4) 

l3 ^ » n “ 3 ' ^ (n-3) (n-4) 1J 2(n-4) ,h (n-4) (n-5) 

^ Definition 5.11. A PBIB design with two associate classes is 

called a latin square type L. (i > 2) scheme w^ith i constraints if 

2 1 2 
v = s satisfies the following association scheme: the s 

^in an s * s square array and i-2 mutually orthogonal 

/Jlfc^S^squares are superimposed. Two treatments are first 

associates if and only if they occur in the same row or the same 

>s are column of the array or in positions occupied by the same letter 

in any of the latin squares. 


-,=2 (n-2) , n^hin-2) (n-3) 


>s are 


= 0, 


1 1 n—2, n—3 / 

P = (p ij J = ( ) , p 2 = (p 2 .) = ( 

J n-3, ^ (n-3) (n-4) U V 


2(n-4) 


For s = 4, taking the array 


11 12 


r i ^ 


13 14 v 15 l 6 

t design with blocks 

Ye obtain the following 2 (13 ,14,15,16), 

(1,2,3,4), (5,6,7,8), <9 '“' (4,8,12,16)- 

,(1,5,9,13), (2,6,10,14),(3- 9 < 11 ' 15) ' 

We X x = 1 , A 2 = °- 
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Note. L 2 is also a simple 
discuss in Chapter 6. 

n.f<n<t<on 5.12. A non-GD PBIB design with two associate cl asses 
is called cyclic if the set of first associates of a treatment a 
is (a+d,, ...,s+d n ) modtvK where the integers d 1# ...,d n ^ sati s£y 

the following conditions; 

1. The d ± are all different and 0 < d ± < v / i = 1**..*^. 

2. Among the n^nj-1) differences., (d^-d^ ,) (mod(v)), each of the 

integer d^, ...,d n occurs pj^ "times and each of other positive 

1 2 

integers less than v occurs p i;L times. 

All the known cyclic association schemes have parameters 
v= 4s+l, n^^ = n 2 = 2s, 


lattice design. This we shall 


p 1 = < s ; x ; : > . 


P 2 - ( s 


' s ) 
. s-1 ' 


Example 5.10 . The following PBIB design with two associate 
classes with blocks (treatments being numbered as 0,1,2, 3,4) 

(0,1,2), (1,2,3), (2,3,4), (3,4,0), (4,0,1), 

is a cyclic design with s = 1, x\ 1 = n 2 = 2, \ ± = 2, A 1 = 1, 

d l = 1# d 2 = 4 * The difference dj - d. , are 4-1 = 3, 1-4 = 2. 

5 * 5 «1» Intrab lock Analysis of Two Associate PBIB Designs : We 
shall consider the intrablock analysis of a PBIB design with 
parameters v,b,r,k, X y X 2 , n 1 ,n 2 , p* i,j,k = 1,2, under the 
normal theory fixed effect model. This is obviously more 
complicated than that of BIB designs. Let x. . denote the 
observations on the jth treatment in the itl/block provided that 
the Jth treatment occurs in the ith block. Let N = (n ) be the 
incident matrix of the P BIB design <n. be the nunfcer of tines 
b . J treatment occurs in the ith block) . The P B I B design is 

(1 7Z "T implyin ® that "ij = loro. Let S be the set of 
variables = 1 l6t US assume that the random 

distributedMii-fc P ond ing to x^) are independently normally 


[ g(X 


'ij 


j = y + a ± + t 
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Ivar j 
i or 


* 0< 


*ij 


= p + on + t + e 


ij 


(5.165) 


where (i, j) eS, the e ± , are independently 

an 0 and variance a ? , the a i e Y nornali Y distributed with 

“• - *■ “•» snsiTrrr 

" e are interested in testing the null hypothesis H . aU \ J - „ 

^ hyP ° theSiS H «* 311 “i " «». - practical 3 significance 

in most cases, can be similarly trpst^ T 
1 y trea ted. In the notation of 

Section 3.1 the hypothesis H T is a linear hypothesis with r = v-lg- 
= v+b-l.H-s = bk-b-u+1. The least squares estimate of y ,c,t. 


s 


under fi is obtained by minimizing s 2 in (5.101) and we obtain 1 the 

l _ r~ / c i . 2 


-/ "X- WWWWiU 

set of Equations (5.102) by minimizing s 2 with respect to these 
parameters. The solution of these equations in the matrix 
notation is given in (5.122). From (5.122) and (5.124) the 
estimate of the treatment effects t ^ for a general incomplete 

a i n 1 « ! 1 _ _ 1- 1. w. _ w. _ w. J _ A . 


block design with k^ = ... = k fa = k, r^ = ... = r y = r is Q_ = C 
where C = (rl(v)_ k _1 N'N). This can be written as, for (i,j)eS 


_ I 


n ij n ij ,t j '' J " 1 . V ’ 


(5.166 


k Q. = (rk-r)t. 

J J i J'(jVj) J 

Let S. (t.) be the sum of all estimates of treatments which are il 

associates of t^ . Then (5.166) reduces to 

/> ~ A (5 16 

K Qj = (rk-r)tj - A 1 s 1 ( fc j) - X 2 S 2 (t j' - 

t- _«+* r- 


Writing S^Q.) for the sum 
ith associates of t ., we obtain 


of adjusted treatment totals which an 


kS l(Q .) . . S.ftjJCrlk-l)- h pll - X 2 p 22 ] 

+ S 2 (tj) C-hPll 1 l 2 V 12' i ' ^ 

k S 2 (Q,) = . x,n 2 ^ + S X l *j) C ' ^ ' "f 22 

»?, - X 2 p 22 ] ‘ 


J 


2 vu j' ~ - A 2“2 w j 

+ SJtJ [r(K-l) - x l p l2 


(5.168 


|rt> 
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-5- 



Since 




It. = t . + S 1 (t ) + S (t .) = 0/ 
j J J 1 J Z 2 

from (5.167) - (5.169) we obtain 

A A 

k Qj = [rk-r+X 2 ]tj + (*2 -X l^ S l^j^' 


(5. 


169) 


■ ■ 

i 


k SjtQj) =U lP ^ + 1 2 P 2 12 - 


+[r (k-1) + ^(pfj-P^) + A 2 (p2 2 -pl 2 ) ]S 1 (t’ j ) 


^ j 


= C (X 2 -X 1 )p 12 ]t i . +Cr(k-1)+X 2 +( X 2 -X 1 ) (P^-P^!) (t^) 


(5.170) 


&&i firoiri (5.163) / ^ P±2 — frj^—1/ ^11 P 12 — ^1 

From (5.170) we obtain 


t < = i (B 22 k Q < - B ’* k S ' (Q <>> ' 


J 

where 


12 ^l vw j 

A 12 = r ^ k - 1 ^ + X 2' B 12 = X 2 ” X l* A 22 = * X 2“ X 1^ P 12' 
B 22 = r(k ” 1) + X 2 + (X 2“ A 1 )(p ll _p ll ) ' 


(5.171) 


A A 12 B 22 ~ A 22 B 12* 


Alternatively, if n 2 < n^ it is convenient for computational 
purpose to eliminate S 1 (t j .) and obtain 


= T (B 21 k Q j - 
where 

B H k S 2 (Qj)) , 

( 5 . 173 ) 

A H = r ( k ~l) + x^. 

B 11 = (X r X 2 )# A 21 = (X l" X 2 )p 12' 


B 21 = r t k-1 ) + x i 

+ ( X 1~ X 2 ) ( p 22 “^ 22 ^ ' 



" A 11 B 21 ” A ^ B 


2111 * 


ftom (5.106), the error 
is given by 


(5.174) 

sum of squares for testing H„ in the ?BlB 




(5.172) f 


I 


*■7 
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13 m \ui)ts " C ' F ’ 3 ' 5 * B* - C.F.: 

= Ctotal S - S - :1 ■ [treatment S.S. (eliminating blocks) ] 

- [block S.S. (ignoring treatments)], 

rthere t, is given by (5.171) or (5.173). The minimum value of s 2 
under is equal to 

[Total S .S . ] - [treatment S.S. (eliminating blocks) ]: 

-fhe best invariant test of H T rejects H T at level a whenever 

A 


, E t .Q . 

hk-b-V+1 ( J 3 \ ^ v 

—' q-j — 1-a, v-l,bk-v-b+l 


(5.175 


The variance of the estimator t^-t . , of the elementary treatment 
contrast t.-t., (based on intrablock information) is given by 

(from (5.171)) 


A S' 


2 i' 1 k(B„+B. 2 )o 2 . if J and r are- first 
• . _ i 22 ^2 associates» 

Varltj-t., - -I ^ ^ i and y a - re second 

L 22 associates. (5.17 

3 • .4= variance of a 2 -associate PBIB design 

The intrablock analysis 

^is given in Table 5.15. 

I - 

rfi-—-- f 


Sources of 
variation 

Blocks 

(ignoring 

treatments) 


d.f. 


b-1 


Treatments . 

(eliminating V '" L 
blocks) 


Sum of squares 


k- 1 i b: - C * F 

i 


q 2 = f t j Q j 


hk -b-v+ 1 —2 
v— 1 q 3 


Error 

Total 


bk-b-v+1 


bk-1 


E * 


2 _C.F. 


1 ill 2 

c.f. = jj x ij 3 
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If the F- value is significant, using (5.176) we can find 
which of the elementary contrast in the treatment effects ar e 
significantly different from 0. 


5.5.2. Missing Observati ons in PBIB Design.: One can treat the 
case of one missing observation in the two-associate class PBI B 
design in the same way as the BIB design in 5.4.2. Giri (1957) 
discussed the analysis of several missing observations when the 
missing observations follow a particular pattern. In this 
context he has defined a new class of design which he called 
"reinforced PBIB designs". 



5.5.3. The Efficiency of the Two-associate PBIB Design ; From 
(5.176) the average variance of the elementary treatment 
contrasts, using intrablock information only, is 

2 (v- 1) _1 [n x kA* 1 (B 22 +B l2 ) + n 2 kA _ 1 B 22 ] a 2 (5.177) 

If we have conducted the experiment in the randomized block design 
with the same number of treatments v, the same number of 
replication r for each treatment and the same a 2 , then the 
variance of the unbiased estimator of any elementary treatment 
contrast , is 2a 2 /f. So the efficiency of the PBIB design 

( 2 -associate class) relative to the randomized block design is 

_ (v-1) A _ _ (v-l kA 

rn l MB 22 +B 12 ' +rn 2 kB 22 rKr(v-i)B 22 +n 1 B 12 ] ' (5.178) 


5-5,4 - —° very of Intrablock I nformation : The recovery of 
interblock information in general incomplete block designs has 
been discussed in 5.4.4. As discussed there, the combined 
estimates suggested by Yates and the maximum likelihood estimates 

true fo^PBiR 6 ^ 1031 ln BIB designs ‘ But - this is not in general 

»atho2 give th nS ' SPr0tt <1956) haS Sh °™ that these two 
for g u IL Lr”! COntoined estinate treatment effects onl, 
Slble two “ asso ciate pbib designs. 

We shall assume the model (5 10 ^ * 
interblock information in pbib 1 \ the rec ° ver * of 

esigns. The intrablock estimates 
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£ treatment effects, obtained by minim- • 2 

b given in (5.131). The combined (intrabl^v 9 S (5 * 130) ' is 
£**-*• Of tj is Obtained b Y ““ *“*““«*> 

w l (i,j) £ es (Xi J - P - “i - tj) 2 +Wlk -l £ (B . ku ,_ £ . ,2 

v 1 1 i Ij J 

with respect to parameters « t- v 

WJ - M ' i'^j' where 


W< = ~~2 * W i = “2 - - -2 

1 ° a B + ka 


(5.179, 


are assumed to be known, so that if we combine (5.105) and (5.131) 
with appropriate weights, the combined estimates i* of the 
treatment effects t^ are given by 

k 1 W *1 Wj-W 1 - 

r [W t (M> + 2 X t * , 

lk K J 15 J' 

w; 


= W^Qj + Pj 9 j “ 1 
where P. = T'j -rb -1 E . writing 

i . 1 W'. M 1 - W 'l 

a = rCw^f-^-) + —® “ it 

I W' , 

■# v j ■ w i q j + — p j 

end comparing (5.180) with (5.166) we can write 

a Z* - BXjS^i'j)- -Vi- 1. V 

Following (5.168), we obtain from (5.182) 

A ^ 

- eX ini t*j + (a-$X 1 pi 1 - BA 2 P 12 )S l (t J ) 

o 2 x cs (t*) = SeCVj# 

+ (-BA^ -3A 2 Pi 2 ) S 2 (t j ) 

A ^ 

-BX,n 2 t* +(-6XjP2i -B^2 p 22 )Sl<t J 
4 4 J 

'• + (a-8X lP 2 -BXjPja 1 = S 2 (V J ) * 


(5.180) 


(5.181) 


write, as in (5.167) 


(5.18 2) 


(5.18 3) 


I - Syi 

5 ' :&&& 


$1 


• .!3 


’■pa 


‘ 


analysis of VARIANCE 


% +s (t * )+S 0 (t*.) = °' fr0rn ( 5 - l82) - (5 * 183)# We °bt ail 

Since t j + & l' j 2 J 


*12 ^ B i2 S l (t V " k V 

A 22 + B 22 S l (t V = kS l (V J > ' 


(5.184) 


where 


h l2 = k ( a+6X 2^ = rtW l +w ’ + (W 1 -W , 1 )^2- 


B{ 2 = k0(A 2 -A 1 ) = (A 2 -A 1 ) (V^-Wj) 


A 22 ~ keA 2 P 12 _k8A l P 12 ( W r W ‘l) (^2”*1^ P 12' 


(5.185) 


B 22 " k ” k8 ^l (Pil -p H^ “ k ^^2 ^ P 12 p 12^ 


1 2 


= rtv^ (k-1) +W* 1 ] + (W 1 -W' 1 ) [ A 2+ (A 2 ~A 1 ) (Pn“Pii) 3 
1 1 2 2 

as P n + P 12 = n 1 -l, p lx + p 12 = n 1 . From (5.184) , we obtain 


* j .= T [B 22 kV j - B l2 kS l< V j )] - J = 1 .V , 


(5.186) 


where A' Aj 2 B 22~ A 22 B 21 ' If n 2 < n i' computationally it is 
convenient to solve the equations (obtained from (5.183) by 


eliminating s 1 (t^)). 


A ii fc *j + B ii s 2 (t V = k v j' 

A 21 *=*, + B- 21 B 2 (i*) = kS 2 (V ), 


( 5 . 187 ) 


and obtain 


t) = CB^kV. - B'^S^V.)], j . !. v , 

where 


( 5 . 188 ) 


- rCWjIk-D+wn + ( W W . )A 

1 1 l' 1' B 11 = (Wj-W^) (X r X 2 ), 

= ( V w i> <V* 2 >Pi 2 . 

-r[w l (k. 1 , + w iJt(Ifi . Mi)[ +( _ 

= A 1 .. r i 12 p 22“ p 22 ■*' 




‘■v l * 


V 
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* 

It may be verified,from (5.186),that 


^ j“ t j ' ^ I A* ^ B 12 +B 22^ a2, if are first 


associates, 


3' a 2 
.A 1 22 ° 


if (j # j ') are second associates, 


(5.189) 

As in the case of BIB designs W.,wi are unknown. Their estimates 

A A II 

W l /W 'l are obtained from (5.142) and (5.143). The estimate 
p of p = is given in (5.153). Statistical inferences 

/V A 

about the t' . are made by assuming that W. = W., W'. = W' and 

/v J 1 1 I 

P=P • 


Example 5.11 . An experiment with 15 different varieties of 
wheat was conducted at Indian Agricultural Research Institute, 

New Delhi, India, to find the best variety suitable for the 
region. A group divisible PBIB design with v= b = 15, r = k = 4, 
Xl =1, X = 0, n x = 12, n 2 = 2, 


pl - (Pij> = < 9 2 ;o> 


p 2 = (p 2 j) = ( 


12, 0 

0, 1 


was adopted for the experimental layout. The layout of the 
design with yield per plot in kilograms is given in Table 5.16, 
the number inside brackets being the treatment number. 

Table 5.16. Yield of 15 different varieties of wheat 


Table 5.16. 

Blocks 


Yields with treatments 


1 

2 

3 

4 

5 


2.4 
(15) 

2.7 

(5) 

2.6 

( 10 ) 

3.4 
(15) 

4.1 

( 6 ) 

3.4 

( 12 ) 


2.5 

(9) 

2.8 

(7) 

2.8 

( 1 ) 

3.1 

(ID 

3.3 

(15) 

3.2 
(4) 


2.6 

( 1 ) 

2.4 

( 8 ) 

2.4 

(14) 

2.1 

( 2 ) 

3.3 

(4) 

2.8 

(3) 


2.0 

(13) 

2.7 

(1) 

2.2 

( 2 ) 

2.3 

(3) 

2.9 

(7) 

3.0 

(1) 


6 
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7 

3.2 

(12) 

2.5 

(14) 

2.4 

(15) 

2.6 

(8) 

8 

2.3 

(6) 

2.3 

(3) 

2.4 

(14) 

2.7 

(5) 

9 

2.8 

(5) 

2.8 

(4) 

2.6 

(2) 

2.5 

(13) 

10 

2.5 

(10) 

2.7 

(12) 

2.8 

(13) 

2.6 

(6) 

11 

2.6 

(9) 

2.6 

(7) 

2.3 

(10) 

2.4 

(3) 

12 

2.7 

(8) 

2.7 

(6) 

2.5 

(2) 

2.6 

(9) 

13 

3.0 

(5) 

3.6 

(9) 

3.2 

(ID 

3.2 

(12) 

14 

3.0 

2.8 

2.4 

2.5 

(7) 

(13) 

(14) 

(ID 

15 

2.4 

2.5 

3.2 

3.1 

(10) 

(4) 

(8) 

(11) 


From (5.173), for intrablock estimates 

A 11 = 13, B H = 1# A 2l = 2 ' S 21 = 14# A = 18 °* 

A k Q. - k S 2 (Q .) 

t j = -^180-' j = 1§ '**' V * 

* V . 

The combined estimates t ^ , from (5.188) , can be written as 
* (a+b) R . -b S (R ) 

t = _J ^ -3 

j a 2 +ab+2b 2 


where 


R j = K Qj + pPj# a = rk - (1-p) (r-1) , b = 1-p, 

P “ W x ' 

and S 2 (Rj) is the sum of the Rj of all treatments which ai 

second associates of t . 

J 

Calculation for com bined estimates . Treatment sum of 
squares (ignoring blocks) 

‘ r_1 * V - c - p - = 3.1363 . 
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rea 


tment 


j 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 


Table 5.17. Intrablock estimates 
T' 


11.1 
9 .4 
9 .8 
11.8 
11.2 
11.7 
11.3 

10.9 
11.3 

9 .8 

11.9 

12.5 

10.1 

9.7 

11.5 


2nd kS 2 (Qj) 

associates _ 


42.5 

42.1 

42.9 

47.9 
44.0 
44 .4 

44.8 
43.0 

42.9 

41.7 

45.8 

46.7 
41.5 
41.1 

44.7 


1.9 

6, 11 

4.2 

-4.5 

CM 

H 

3.7 

-3.7 

8, 13 

-0.5 

r~ 

• 

o 

i 

9, 14 

0 

0 .8 

10,15 

-1.2 

2.4 

11,1 

3.7 

0 .4 

12, 2 

-1.2 

0 .6 

13, 3 

-4.8 

2.3 

14,4 

-3.0 

-2.5 

15, 5 

2.1 

1.8 

1,6 

4.3 

3.3 

2,7 

-4.1 

-1.1 

3,8 

-3.1 

-2.3 

4,9 

1.6 

1.3 

5, 10 

-1.7 


0.1244 
-0.3700 
-0.2850 
-0.0 544 
0.0688 
0.1661 
0.0377 
0.0733 
0.19 55 
- 0.2061 
0.1161 
0.2794 
_0 .0683 
-0.1877 
0.110 5 



Blocks 

(ignoring 1 ‘ 

treatments) 

Treatments 

(eliminating 1 

blocks) 

Error 31 

(intrablock) 


4 .9233 

0.3516 


1.5632 

0.1150 

1.328 

2.6878 

0 .0867 



■ 


a *>•, 
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. „ u , res (eliminating treatments) 

Block sum of square 

[Block sum of squares (Ignoring treatments)] - [treat, 

' “ squares (ignoring blocks) ] ♦ [treatment sum of Squar 

(eliminating blocks)] 

_ 4.9233 - 3 . 138.3 + 1.5632 
= 3.3482 . 

. 3.348 2 _ q 24 20 

Block mean square (adjusted) = 14 " ° * 

A - 

A W 1 4 5 ( 0 . 0867 ) — = 0.3099 , 

P = T “ =‘^6 ( 0.24 20)-11 ( 0.086 7 ) 


= 0.3099 , 


a = 13.9297, b = 0.6901, a 2 + ab + 2b 2 - 204.6019. 


14.6198 R, - 0.6901 S 2 .(R.) 

* _ J _ ■— 

j " 204.6019 


, j — 1» • • • i ^ • 


0.0519, if (j,j‘) are second associates, 

ar (t * -tf, ) = 1 

1 J ^0.0495, if (j, j ') are first associates. 

Table 5.19.. Estimates of t- 


Treatments j 


0.0877 
-0.3746 
-0.2782 
0.0457 
0.0681 
0.1728 
0.0567 
0 .0459 
0.154 3 
-0.2322 
0.160 3 
0.3158 
-0.1161 
0.2317 
0.1217 
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erred to Section 5.4 .5 The re ader is 


e £erred to Section 5.4.5 The r «*« is 

sed in this section. We shall investigate h "* *" 

whi ch the normal theory beta distribute th ® eXtent to 

Approximate the distribution of u Under *** be to 

ft. = 0 for tWo ass °ciate P B IB designs. tv 2 ***** V a11 

the first two moments of u under (R n ) ^ ^ by com P uti «g 

the corresponding moments of a central\ T* COIn P ar ing them with 

parameters (*s(v-l) , i* (bk-b-v+1)) . For With 

the first two moments of u under (r r ) SS ° Ciate PBIB designs ' 

,1963) for the case X. = 0 \ ‘T * *“ 

' A 1 u ' a 2 - 1/ by Gin (19 65) for the case 

h = °' H C1Sr °f (1969) for the general case.. In all cases 

E(U) - b (k-1) (v-1), which agrees with E (U) under the normal 

theory model (5.165) . The computation of Var (u) under (R 1# R 2 ) is 

much more complicated and no direct analytical comparison can be 

made with Var (U) under the model (5.165). Cleroux. (1969), with 

aid of computer # computed the ratio R of the variance of U under 

l l 

^^ 2 ) to the variance of U under the model (5.165) for nearly 
two hundred designs and made the following observations: 

(1) For group divisible design these two variances are 
Approximately equal, that is, 90 £ R i 1.10. 

^(2) For simple designs these two variances are approximately 
equal in some specific cases. 

(3) In all other cases they are unequal. 

• n raseS wh ere these two variances are unequal one 
However, m cases wnere 

can approximate the distribution of V ™ « r V 2 are T such 
central beta ran d o, variable are 

that the mean and the varia " ^ cleroU x (1969) for the 
equal respectively. ( We re of f ree dom in terms of the first 

Required expressions f° r the ^ 
two columns of U under R^V 

5.5.6. JXow^balance—^ 

enure class of PBIB designs «» > ^ = 


lie cia&& ‘ fti) b - mv, 

sndina on (i) b ~ v ' 
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. T are positive integers. Bose and K lshet) (w# 
where m > 1 an . lanclng in PBIB designs with b - v and 
developed the iv— wltho ut recovery of interblock inf 0tma 

discussed their ana extended this idea to the case b = Kv 

tion. Shrikhan e designs with b ^ mv for wh lch 

" -eU as b ,« - r d Gl ri (1»*> obtained a class of de st , 

row-balancing i P° rowbalancing is possible. As r ? mk, 

with b i* ■* _° treatnl ents within blocks so that every treatment U 

replicated * constant number of tines in each row is impossible. 
Z alternative is that some partially balanced arrangenent of the 
treatments within rows nay be possible. It nay be possible to 
have such arrangements for many PBIB designs but an analysis 
after eliminating row effects becomes straightforward if the 
following conditions are satisfied: 

(i) each treatment is replicated at least m times and 

atmost (m+1) times, ip each row? 

(ii) the (m+1) th replicate of the treatments (named as odd 

treatments) can be so arranged among the rows that the rows form 
d'PBIB design. 

Example 5.12 . Consider a PBIB design with v = 12, b=16, k=3, 
r=4, Aj=l, \^=0, n^=?8 # n 2 =3. 


>1 _ ( 4, 3 
' 3,0 


P 2 = 


— w • 

aving the following arrangement . 


Blocks 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 

2 5 7 10 4 1 6 3 11 8 12 9 10 7 4 1 


1 4 8 11 12 9 10 7 6 3 

3 6 9 12 8 11 2 5 7 10 


2 11 8 


The arrangement of the odd treatments in the rows are 








tf hich is 
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i rrr> wi +-"u _ 4,7 ’ 


ni>,u THEIR analyse - 

is the PBIB design with palters. v= 12 b , ‘ 

n l = 8, n 2 = 3 , X, = 0# x = , wl - 2 12 ' b = 3 ' r = 1 , 


% 


** — parameter: 

4 # n. = 8, n 0 = 3 x = n i 

1 2 1 u *. a 2 “ .1# P ,P 4 

Giri (19 57) has proved thaf 4.1, .. ,, 

Bose and Rair (U39 “ Wln5 ° f d “ i °“ 

With naramofnrn _# .% . 2 


fi- by —- — <»»,««, r? 

% = P* * = P -l, n = p(p-2 ), n = p _i x - 1 > V P ' 

2 f A i — 1, A 2 = 0/ 

/ v / 


,1 = 


> (p-3) , b-1 


P-1 



where p is a prime or a power of a prime number can be arranged 
satisfying the above two conditions. Example 5.12 is a special 
case of it when p = 4. The analysis with and without recovery of 
interblock information is quite straightforward and goes in the 
same way as the row-balanced BIB design in 5.4.6. The linear 
model assumed for. the analysis is given in (5.158). The reader 
is referred to Giri (1957) for details. 


EXERCISE 


1. An agricultural field experiment was conducted on cotton at 
ifthe Dry Farming Research Station at Dhanduka, India, with the 
idjgf Of finding out a suitable level of farm yard manure for a 

. _ ii\ nn farm vard 


_____ jr 1UUX ^ — 

r^Vi^ ield 0 f cotton. The treatments are, (1) no farB 

_ , Q .,-p^ ( 3 ) 31 


nanure feTls'korn-s of farm yard manure per hectare, (3) 30 
manure, \2) Vr^rtare ( 4 ) 75 tonnes of farm 

tonnes of farm yard manure per hectare, 

( 5 ) 150 tonnes of farm yai.u 

Wrd manure per hectar , manure per hectare. The 

hectare and ( 6 ) 300 tonnes vield of cotton per 

layout (completely randomized design) with yre 

Hot in kilograms is given bel treatment number, 

The number inside the brackets indica • hypothesis that the 
Analyse the.experi mental data to 
^eatment effects are eaual. 
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5.40 

(4) 

8.90 

( 6 ) 

5.85 

( 2 ) 

3.10 

( 1 ) 

5.05 

(3) 

2.25 

( 2 ) 

8 40 

4.50 

3\*0 

6.40 

3.60 

6.70 

(4) 

( 1 ) 

( 6 ) 

(5) 

( 2 ) 

(3) 

3. 20 

3.40 

4.45 

8.05 

6.70 

5.40 

(3) 

( 6 ) 

( 6 ) 

( 6 ) 

(4) 

(5) 

5.50 

5.80 

3.70 

4.50 

5.65 

4.30 

(4) 

(5) 

( 1 ) 

(3) 

( 1 ) 

( 2 ) _ 



2. Suppose that in a randomized block design with v treatments 
and r replications for each treatment the treatment i in the block 
j is interchanged with the treatment j in the block i. Give the 
analysis of variance of the resulting design. 

3 . In a randomized block design suppose that treatment 1 was 
used twice by ~mistake in the first two blocks. How do you 
propose to analyse the resulting experiment? 

4. Suppose that in a v*v Latin,square design two treatments in 
the same row (d»r column) ,cjot interchanged by error. Obtain the 
unbiased estimatot (if any) of different elementary treatment 
contrasts and thfeir, variances * 

5. Ah experiment in a Latin sqhare layout was conducted at the 
Agricultural Research Station, Marutern, Andhra Pradesh, India, 
to find out the different bulKy organic manures , each supplying 
40 kilograms of Nitrogen per hectare, on the yield of Paddy, 
treatments are, 

(1) farm yard manure, (2) Gl^ricidia, (3) farm waste compost, 
(4) night soil compost, (5) sannhemp compost. 

The actual experimental layout wi)th yield of paddy per plot in 
kilograms is given below. _, 


The 


12.5 

16.3 

14.0 

8.5 

io .2 

(3) 

($) 

( 1 ) 

(4). 

(3) 

14.5 

13.5 

- 7.9 

9.2 

11.5 

(5)' 

( 1 ) 

(4), 

( 2 ) 

(3) 

13.5 

10.5 

8.5 

13.6 

18.9 

( 1 ) 

(4) 

( 2 ) 

(3) 

(5) 

11.0 

13.2 

15.2 

20.3 

15.6 

(4) 

( 2 ) 

(3) 

(5) 

( 1 ) 

14.5 

10.5 

l 2,.0 

10 .9 

- 8.8 

( 1 ) 

- C3> 

_15) 

—111 _ 

_( 1 ) 


Analyse the data to find the best treatment (if ayiy) • 
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Show that for a symmetrical BIB design with v even, (r-v) 
mUS t be a perfect square. 

7 . Let N be the incident matrix of a BIB design with parameters 
v,b,r,k,*. 

( a ) Show that | NN' | = 0 if b ^ v . 

(b) Show that the characteristic roots of NN' are rk and r-X of^ 
multiplicaties 1 and v-1 respectively. 

(c) Show that if b = *v 
|NN'|= r 2 (r-X) V_1 . 

8. Prove that for a resolvable BIB design 
rank N < b-(r-l) . 


9. Given a set of v treatments, consider a BIB design with 

m 

parameters v , b = i$v(v-l), k = 2, r = v-1, X= 1 (blocks are 
formed by taking all possible different pairs of treatments) . 
Taking blocks for treatments and treatments for blocks in this 
design obtain a new design in which any two blocks have a single 
treatment as a common link. Show that the new design is a PBIB 
design. Obtain the parameters of this design. 


10. The followings are the plan and yield in lbs of 13 varieties 
of wheat (numbered 1 to 13) in an ejq>erimental trial conducted in 
the Agricultural Research Institute, Pusa, India. The figures 
inside the brackets indicate the variety number. 


Blocks 


Yield of wheat bv varieties. 


1 

2 

3 

4 


50.1 
(3) 

45.3 

(3) 

32.0 

( 10 ) 

54.1 

( 2 ) 

46.5 

(7) 


39.5 

( 6 ) 

38.9 

(4) 

38.2 

(ID 

54.5 

(5) 

60.9 

( 8 ) 


57.3 
(9) 

66.7 

(8) 

63.5 

( 12 ) 

70.7 

( 8 ) 

61.3 
(9) 


48.5 

( 11 ) 

45.3 

( 12 ) 

53.9 

(13) 

34.3 

(ID 

64.3 

( 10 ) 


5 
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J 



6 

60.7 

(4) 

65.0 

(5) 

54.2 

(6) 

65.1 

(10) 

7 

68.5 

(1) 

62.1 

(5) 

51.7 

(9) 

61.3 

(12) 

8 

68.5 

(3) 

65.1 

(5) 

66.1 

(7) 

72.9 

(13) 

9 

76.7 • 
(1) 

65.3 

(2) 

75.1 

(3) 

62.6 

(10) 

10 

57.1 

(2) 

60.9 

(4) 

53.5 

(9) 

70.3 

(1*3) 

11 

72.9 

(1) 

62.3 

(4) 

62.7 

(7) 

5-7.1 

(11) 

12 

63.9 

(1) 

67.3 

(6) 

55.3 

(8) 

84.3 

(13) 

13 

60.7 

(2) 

63.1 

(6) 

79.1 

(7) 

53.5 

(12) 


Identify the layout actually used in this experiment. Analyse 
the data to find the best variety if any. ' 


11- With a view to selecting the varieties of wheat giving highe, 
yields an experiment was conducted with 9 varieties of wheat in 
Haryana Agricultural University using a symmetrical PBIB design 
The layout of the design with yield per plot in kilogram is give, 
below. The figures inside the brackets indicate variety number. 

Yield in kilograms Blocks 


4 


52 

(3) 

53 

(9) 

48 

(5) 

54 

(1) 

59 

(8) 

62 

(6) 

45 

(2) 

46 

(9) 

47 

(6) 

31 

(1) 

28 

(2) 

25 

(3) 


Find the parameters of thf* ■ 

the design. Analyse the data. 

control treatment landed S “ PP ° Se 

ls now increased to k+i u ° ck S ° that the block size 

de sign? ° w do you plan to analyse such a 
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Source,: Biometrics Vol.ll, 1955 , page 112> edited by 

.W.Snedecor. 

An experiment was conducted in a Latin square layout to 

study the responses of five pairs Df monkeys to a certain kind 

of stimulus under five different conditions: A,B,C,D,E during 

five periods consisting of successive weeks. The figures inside 

the brackets represent the total number of responses. 

_ ~~ ' ■ - 

Total number of responses 

Animal Period 

1 2 3 4 5 

1 B(194) D(369) C(344) A(380) E(693) 

2 D(202) B(142) A(200) E(356) C(473) 

3 C(335) A(301) E(439) B(338) D(528) 

4 E(515) C(590) B(552) D(677) A(546) 

_J_ A(184) E(421) D(355) C(284) B(366) 

(a) Analyse the data under normal theory model. 

(b) Analyse the data assuming that the value in period 2 with 
animal 2, and the value in period 4 with animal 4 are missing. 

14. Source : Introduction to Engineering Statistics by 
I. Guttman, the late S.S.Wilks and J.S.Hunter, John Wiley(1971) 

The following data were recorded in a radioactive counting 
rate experiment of 4 specimens: S^S^S^S^ of radium. These 
specimens were subjected to a counter with 4 shielding methods: 

Y M 2' M 3' M 4 in 4 different orders: Rj.R^Rj.R,, so that the 
entire experiment layout is a Latin square design. 


Shielding Method 


3rder 


S 1 (26.46) 
S 2 (27.58) 
^3(29.54) 
S 4 (29.51) 


S 3 (29.61) 
S 4 (29.52) 
S (27.00) 

i 

S 2 (28.03) 


S 2 (27.82) 
S (26.48) 
S 4 (29.31) 
S 3 (29.53) 


S 4 (29.15 
S 3 (2S.13 
S 2 (27.90 
S 1 (26.51 


arry out analysis of variance. Use S- and T-Methods with 
*°5 to compare pairwise the different shielding methods. 
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« • 

15 source : Introduction to Engineering Statistics, by 

I.Guttman, the late S.S. Wilks and J.S. Hunter, John Wiley. ^ 

Microminature integrated circuits are manufactured, in part., by 
decoihpositing thin films of dialectric material in predesi gne(i 
patterns. Prior to mounting the film, the subtrate has to be 
prepared. In a study to determine the best operating condition 
fpr preparation 4 factors: A,B,C,D, each at 2 levels are studied 
The experiment was blocked into 2 blocks of 8 runs each and each 
block corresponding to a day. The design and the deposited sio 
film (y) are given below. 

A = Subtrate temperature, levels, A q = 250°C, A^ = 300°C, 

B = Vacuum in chamber 1, mm of Hg, 

_5 

levels, B = 1 x 10 , 

° -5 

B = 1.5 x 10 . 

IB -= Vacuum in chamber 2, mm of Hg, 

levels, C q = l x 10~ 5 , = 1.5 x 10 -5 . 

D = Pattern types, levels, D q = type 1, D 1 = type 2. 

_ --- / — " ' -- 

Mack 1 -Block 2 

Treatment _y^ Treatment y 

A o B o C o D o 3 ' 43 A o B o C o D l 3.62 

A l B o C o D l 4 ' 04 A l B o C o D o 4 -' 17 

A o B l C o D l 3 - 57 A o B l C o D o 3.48 

A l B l C o D o 3 - 8 ® A l B l C o D l 4.18 

A o B o C l D l 4 -09 A o B o C {° 0 3-27 

A l B o C l D o 3 - 27 A l B o C l D l 4.35 

WxV 3 -15 A oBl C lDl 4.09 

4 - 20 A.B.C n , c. 


fu? “* test the hy P° the sis that there is no 
difference of block means. 

:: 6 rr tion ° f a baian ° ed bi ° ck desi9n 

' 1961, Technometrics 
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The data were collected from a dishwashing experiment where 


the 


number 

foam 


treatments were detergents and the response observed was the 
of plates washed under standard test conditions before the 
disappeared. In the’experiment the detergent solutions are 
m ade up and plates soi led with standard soil were washed" one at a 
ime until they are clean. The test procedures calls for 3 
basins to be used for 3 treatments to be tested simultaneously 
an d the operators wash at a common speed during a test. A block 
is thus a set of 3 operators testing 3 detergent simultaneously. 
The treatments are denoted by A,..., I and the responses are given 


within brackets. 


Blocks Bloc ks. 


1 

A (19) 

B ( 17 ) 

c(ll) 

7 

A (20) 

E (26) 

I (31) 

2 

D (6) 

E (26) 

F ( 23) 

8 

B (16) 

F (23) 

G (21) 

3 

G (21) 

H (19 ) 

I (28) 

9 

C (13) 

D (7) 

H (20) 

4 

A (20) 

D (7) 

G (20) 

10 

A (20) 

F (24) 

H (19) 

5 

B (17) 

E (26) 

H (19) 

11 

B (17) 

D (6) 

I (29) 

6 

C (15) 

F (23) 

1(31) 

12 

C (14) 

E (24) 

G (21) 


Analyse the data and interpret the results. 
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CHAPTER 6. ANALYSIS OF VARIANCE II* FACTORIAL EXPERIMENTS 


This chapter deals mainly with factorial experiments. i n 
Chapter 5 we have discussed analysis of complete and incompi et 
block designs where the treatments are not necessarily relate^ 
to each other. In factorial experiments several factors effect 
simultaneously the characteristic under study and the experi ment 
is interested in the main effects and interactions between 
different factors. We shall consider briefly the technique of 
confounding to get incomplete blocks of smaller sizes when the 
number of treatments is large. We shall then consider the nested 
classifications, (Crossed and nested classifications, split-plot 
designs and Lattice designs which are some standard incomplete 
factorial designs commonly used in actual experimentations. The 
presentation in this chapter is far from complete and our main 
aim will be to give the analysis of variance of some standard 
incomplete factorial design. The random-effect^model and the 
mixed-effect model will be introduced as they are very 
appropriate to analyze the nested classifications, the crossed 
and nested classifications and the split-plot designs. 

We refer to Kempthorne (1951), Federer (1959) and Raghava Rao 
(1971) for design part of the factorial experiments including 
both the complete and incomplete ones. 



6.1. 2 -FACTORIAL EXPERIMENTS 

A group of treatments, which contain two or more levels of two or 
more factors rn all possible combinations is known as a 
factorial arrangement. For two factors A, B, A with two levels 

“?'“ 1 B With tWO levels VV the four treatment combina¬ 
tions are a^, a^. a.b, . 

b . eff6Ct ° f the factor * «hen the factor B is at the level 

b ° 1S a l b 0' a 0 b 0 ) ' and the same when the factor B is at the level 

A*? * a i/ 1 w° bl) ’ ^ aV6ragS (bf these two effects) Ijfa.b,- 

01 1 * (a l b O- a oV is called the main effect of A and is 

sh formal Its ■=> - 


written formally as the expression 
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(AB) = * (a r a Q ) (b^) (c^+Cq) 

Similarly the other two 2-factor interactions are 

(AC)= ^(a r a 0 ) (b^bQ) ( Cl -c 0 ), (BC) = Ma 1 +a () ) (b r b Q ) (c rC 

If we evaluate the interaction (AB) for each level of c we g et 
Js(a 1 b 1 c: 1 +a 0 b 0 c 1 -a 1 b 0 c 1 -a () b 1 c 1 ) when the factor C is at the i 6Vel 
c x and ^(a 1 b 1 c 0 +a 0 b () c 0 -a ;1 b 0 c 0 -a 0 b 1 c 0 ) when the factor c i s at x 
Obviously the average of these two quantities is the interaction 
(AB). If these two quantities are different ‘the average 
difference, written symbolically as 

(ABC) = ^(a 1 -a Q ) (b^-bQ) (c^-Cq) 

is called the 3-factor interaction (ABC). Writing 

y= -g ( a o b o c o +a i b o c o +a o b i c o +a i b i c o +a o b o c i +a i b o c i +a o b i c i +a i b i c i) 

the coefficient of treatments in the linear combinations 
representing different effects and y are given in Table 6.1. 

Table 6.1. Coefficients of treatments in different effects 



Effects 


Treatments 


*o b o c o a l b 0 C 0 a 0 b l c 0 a l b l c Q a o b o c r a i b o c i a 0 b l C l a l b l c l 


Note. The term "effect" will be used to indicate main effect or 
interaction of any order. From Table 6.1, it is easy tc see that 

linear combinations representing various effects are mutually 
orthogonal. 

,n. ,°t Ven n faotors each at two levels there are n main effects, 
2 ° r interac tions, ( ) 3-factor interactions and so, on. 










, ■«*. 
-<> 'r». ' 
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4 -otal number of effects is 2 n -l, which is also the 

e the 

e* 1 ^ degrees of freedoms associated with treatment sum of 

uitib er 

lC juares- 

development of factorial experiments is mainly due to 
^neering works of Yates and the reader is referred to Yates 
-*ie pi ° t he treatment of this topic and relevant original 
(1937) 10 

^eferenoes. 

general factorial experiments 

;;I' the treatments consist - - 

"•-££££££ 1 15 at 
* ty r el x i = ..»• may be written as 

the level x^ 

® £ Xi or t (x 1# • • • '*t) 

.5, i 

_ C. 4-^03 hments 


-1 

treatments 

A linear function of tr 


) x (x,,•••' x n^ different values o 

Ec(x, , - • *' x n 1 , x ), for au _ 

ailed a contrast if the ® *1' ~^ Q and S d*i.“ 

iS 0311 x , are not all e,ual to - ^ q£ .x„. 

K, "•*' n m taken, over . rtnr 


, 1e d a conned- - and L 

iS 0311 x , are not all e,ual to - ^ q£ ..x„. 

.. x n ' a.-on is taken, over f tor 

I A treatment c°_ ^ tYl e factors H 

interaction ** afferent values of .. . 

\ for all 

to a11 ct *v;-;; » 

depends only ° ij , = ! . 1 k‘ 

/ ,X n ) s °' 3 1 ret <t>- de * 

Ui> * •‘*1- called a main 1 

*1 .action is caU ® that is. 

* °* the factor 
j ^ ^1 + rtf ' 1 * . . n gt belongs t0 ^ K ' £ 

= f i nt contrast 

t -i treatment 

Then a typi cai ) is 

I interaction ^i, v 


4 § 
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• * • 4> n 

d>. •••<!> 


I l (X 


1 1 

. / .. • / x. ) f. ... f ^ 

X 1 x k 1 1 


k 

where the £(x. , ...,x ) f or different values of x ± 

X 1 x k 1 e 

not all zero and 

E a (x ,x )= 0, j =i^ / • • • / i k - • (6.10) 

Xj 1 k 

One obvious way of obtaining these contrasts is to choose 


(x. /. •«/ x, ) — n 
*1 1 k j=i 


W 


where e £, (x. )=0, j = i.., 
x. J J 
1 

. there are 


• ;i k* 


( 6 . 11 ) 


From (6.11) it follows that 


4 V 

II (s.-l) independent contrasts 
j=i l 

4>1 • • -<£ n lk X, 

■ -V Z £ . (x .) f , 

i=i j. j j 

X 1 x k . J X 1 

all belonging to the k-factor interaction (F. ...F. ). Further- 

X 1 1 k 

more any two different contrasts belonging to the same or 
different interactions and main effects are mutually orthogonal. 

Thus there are (s.-l), i = i- n , set s of independent 

(linearly) treatment contrasts all belonging to the .main effects 
and (s^ 1 ) (Sj-l) , i ^ j, i,j = l,...,n sets of independent 

contrasts all belonging to the 2 -facxor interactions . 

n 

( 8 -i‘ 1) lndependent contrasts all belonging to the n-factor 
interaction. Since 


Ji- <s i~ 1,+ J I(s i-i) <s j-D + ••• + n (s ± -i) 

i=l 

=C A ( 1 +s i- 1)] - i = n s -l , 

1 ~ 1 i=l 1 

w!th°oneT^ hat ^ t0tal n « mDer ° f inde P en <3ent contrasts, each 
one degree of freed™ £ s,-!. A factorial experiment 



'A* 


analyse of variance I, 30, 
(with n- factors)^ is called symmetrical if s 

th en called a s factorial experiment. Then < f '" Y n = “ a " d lB 
2 levels/, each main effect or interaction « 11 actors are of 

ene degree of freedom. ° f a " y «I» 

Example 6.1 . Consider two factors A and B, a with levels 
a 0 ' a l ,a 2 and B Wlth levels b 0 , bl ,b 2 . The treatment combinations 

are a 0 b 0 ,a 0 b 1 ,a 0 b 2 ,a 1 b 0 ,a 1 b 1 ,a 1 b 2 ,a 2 ,b 0 ,a 2 b 1 ,a 2 b 2 . The 
contrasts belonging to (A) (each with one degree of freedom) are 

( a 2" a 0* ^ b 0 +b l +b 2* ' * a 2” 2a l +a 0* ^ b 0 +b l +b 2 J * 

Similarly the contrasts belonging to (B) are 

( b 2” b 0^ ^ aQ+al"^" a 2^ ' ^ b 2” 2b l"^" b 0^ ^ a o"^" a l"^" a 2^ " 

The contrasts belonging to (AB) are 

^ a 2” a 0^ ^ b 2"” b 0^ ' ( a 2“ 2a i +a o^ ^ b 2” b 0 ) ' 

(a 2 -a Q ) (b 2 -2b 1 +b 0 ), (a 2 -2 ai +a 0 ) (b 2 -2b 1 +b () ) . 

If the factorial experiment can be carried out in a complete block 
design such as randomized block or latin square design with r 
replications per treatment the best estimate of any treatment 
contrast is obtained from the treatment contrast by -Pla-g 

«. W ««** 

from the experiment. Let T i i k 

, . - all treatment combinations where the factor Fj 

total yield of a The sum of squares due to 

is at the level *y 3 '/^tor F *is given by 

the main effect of the factor F, is g 


(F x ) = 


_ 

r s 2 “..s n 


( 6 . 12 ) 


" 1 of freedom where T - S* T l (x l ) 

Lth s r l degrees of freeo X 1 1 

ff6CtS of oth©^ 

nd similar expressions hold ^^interaction VfJ is 

actors. The sum of squares (6 .i3) 

^(FjFo) = E s r sv* ,s n 


i 

* 

! 
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and similar expression holds for any two-factor interaction 
It may be noted that SS(F 1 F 2 ) is the error sum of squares 0 f 
a two-way classification with factors F 1 ,F 2 and with a singly 


observation T 10 (x 1# x 0 ) per cell. Similarly the sum of 


12 '~ 1 '*' 2 ' 

squares due to interaction (F^F^j) is given by 


ss(f 1 f 2 f 3 ) = C 


E 

x. 


I 

x. 


123 


(x 1 ,x 2 ,x 3 ) - — 


1 * * 


n 


- ss(f 1 ) - ss(f 2 ) - ss(f 1 f 2 ) - ss(f 3 ) 

- ss (f x f 3 ) - ss (f 2 f 3 > . 


(6.14) 


The calculation of sum of squares of other interactions are now 
obvious. The analysis of variance table of a complete block 
factorial experiment is given by the analysis of variance table 
of a randomized block or latin square design depending on the 
design actually used to carry out the. experiment. The only 
difference in this case is to divide the total sum of squares 
into the sums of squares of different main effects and inter¬ 
actions. As an example we give the analysis of variance table 
of a 3-factor factorial experiment, carried out in a randomized 
block design with r replications per treatment (Table 6.2). The 
significance of each main effect and interaction is tested by 
F-test. For example, to test the significance of the main 


effect (F^) we use the statistic F = MSS(F 1 )/MSS (E) which has 


F-distribution (s^-1), (r-1) (s 3 s 2 s 3 -l) degrees of freedom under 

the hypothesis. Similarly the significance of the interaction 
<F 1 F 2 > is tested by the statistic F = MSS (FjFj) /MSS (E) which has 
F distribution with (Sj-1) (s.,-1) , (r-1) (s^Sj-l) degrees of 
freedom under the hypothesis. 


6,3. CONFOUNDING 

When a large number of treatment contoinations are to be 
experimentally tested, it is undesirable to use complete block 
designs because in that case plots in each block may not be 
homogeneous in their soil fertility, since this may result in 
arger experimental error variance it is desirable to keep the 




’Xt 
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6.2. ANOVA Table of a 3-factor factorial 
Tabl e * * eX p er inient carried out in randomized 

block designs _'_ ' _ 


Sum of Mean sum of 
squares squares 


Re plica- 
t ions 

Treatments s^s^-l 


Main 

effects 


(FJ 


Si - 1 


SS^) mss (f^) 


(F 2 ) 


(F 3 ) 


Interactions 


' F 1 F 2> 


;F 1 F 3> 


F 2 F 3> 


F 1 F 2 F 3> 



rs l s 2 S 3" 1 


MSS(F t ) 

mssTeT” 


MSS (F 2 ) 


3 2 -1 

ss(f 2 ) 

mss(f 2 ) 

mssTeT 

mss(f 3 > 

s 3 _1 

ss(f 3 ) 

MSS(F 3 ) 

MSS (E) 

MSS (F X F 2 ) 

(s-j-DU^l) 

ss(f 1 f 2 ) 

MSS (F’ 1 F 2 ^ 

MSS(E) 

mss(f 1 f 3 ) 

(s r l) (s 3 -D 

ss (f 1 f 3 ) 

mss(f 1 f 3 ) 

mssTeP 

MSS(F 2 F 3 ) 

(s 2 ~l) ( s 3“-^ 

SS(F 2 F 3 ) 

MSS (F 2 F 3 ) 

MSS(E) 

MSS (F 1 F 2 F 3 ) 

(s^- 1 ) 

( s 3"D 

ss (F x F 3 F 

3 ) mss(f 1 f 2 f 3 ) 

mss(eJ 

(r-1) (s 1 s 2 s 3 -D 

ss (e) 

MSS(E) 



as "confounding" the 

Lock size small. By a device • ^ ^ randomi2e a block 

ecessity of all treatments i ^ latin square design is 

esign or in each row and P™” ^gjyjjand the whole block or 

We-stepped (see Yates (1933, ’ number of incomplete 

;he replicate is divided into i confounding let us consider 

‘locks, to understand the concept of co 

following example* 
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Example 6.2 . Consider a factorial experiment with 3 f 
A, B and C, A with levels a Q/ a^B with levels b^b^ and c Wi1 ° rs 
levels Cq,c^. There are 8 treatment combinations, for a comp 
factorial randomized block design each block contains all the 
treatment combinations. Consider the interaction 


(ABC) = %( ai -a 0 ) (b 1 -b () ) (c 1 -c Q ) 

♦ 

- ^ (a^b^c^+a^b-QCQ+aQb^CQ+aQbQC^) 

- ! ‘ <a l b l° 0 +a i b 0 c l +a 0 b l c l +a 0 b 0 c 0 ) 

and suppose that we arrange 8 treatment combinations in the 
following 2 blocks (the figure \inside the: brackets denotes the 
treatment number). 


Block 1 


Block 2 


a l b l C l a i b o c o a o b o c o a 0 b 0 C l ' a l b l c O a l b 0 C l a 0 b i c i a o b oV 

The block-1 contains all treatment combinations with positive sign 
and the block -2 contains all treatment combinations with negative 
sign in the expression for (ABC)'. Lek x ± . denote the observation 
on the jth treatment in the ith block and let 

X ij = P + “i + t j + e ij' 1 = *' 2 , j = 1 / •/ 8 , 

where v is the grand mean, c. is the ith block effect satisfying 

“l + “2 - °- b j is the J th treatment effect satisfying I t. = 0 

and the e^ are observation errors. Let u, a 1 denote the • 

estimate of y, o^, t^ respectively. Then 


X 18 = w + “1 + t g 

A , A A 

^17 = M + + ty 

*16 = K + “i + i 6 

x 15 = y + o. x + t 5 

Consider 


X 24 = W + “ 2 + b 4 

X 23 = U + ° 2 + t 3 

^ /N A, 

X 22 = u + a 2 + t 2 

* * A, 

X 21 = u + a 2 + fc i- 


q "F X. — + x + v v 

= k<? , I 16 A. 15 r X 24 x - x 23 - x 22 - x 2 i) 

8 + + - t t 2 21 " 

- *(ABC) + !, Wl ! S 2 ) . 4 3 2 * b l> + >5 <«! - « 2 > 
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fr0 m this i t is obvious that it is not possible to estimate (ABC) 
£re e from block effects. In other words the interaction (ABC) and 
block differences are extricably mixed up or completely confounded 
tf ith each other * It is worth noting that if we have one complete 
block instead of two incomplete blocks then (ABC) can be estimated 
free from the block effects (here a 1 = a 2 ) . 

With the incomplete blocks, 1,2, the other main effects and 
interactions are not confounded with the block differences. To 
see this consider the main effect (A) 


(A) = !((>jb,c 1+ a 1 b 0 °l + a l b lV a l b 0 c 0 ~ a O b l c l' a 0 b 0 c l- a 0 b l c 0 - 

- a o b o c o ) ' 

Since 

J 4 (Xi8+x 2 3+ x 24 +x 17- x 22- x 15- x 16- x 21 ) 

A. A A A A. A . A A 

= ^(tg +t 3 +t 4 +t ? -t 2 -t 5 -t 6 -t 1 ), 

(A) is not confounded with the block differences. Similar 
conclusions hold for other effects except (ABC). 

In Example 6.2 we have considered only a single replication. 
However, we could have r replications by replicating those two 
incomplete blocks r times or by taking different pairs of 
incomplete blocks for different replications. To get a 
different pair of incomplete blocks of size 4 we obviously need 
to confound a main effect or a 2 -factor interaction with the 
block difference. By replicating the same pair of incomplete 
blocks r times we confound (ABC) completely with the block 
difference in the whole experiment. 

If a main effect or an interaction is of little or no 

importance, it is recommended to confound it with block 

differences and such an effect is called "completely confounded". 

If an effect is confounded with block differences in some 

implicates but is not confounded in the remaining replicates, 

it i s called partially confounded. 

To reduce further the block sites we need to confound two 

“more effects simultaneously. Suppose in the above examp ey 

decide to confound (ABC} 


^t incomplete blocks of size 2 and we 
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and (A). Assigning all treatment combinations having »+u 

s ign 

in both (ABC) and (A) in block 1, treatment combinations haw 

' av irig 

plus sign in (ABC) and minus sign in (A) in block 2, treatment 
combinations having minus sign in (ABC) and + sign in (a) i n 
block 3 and finally treat combinations having - sign in both 
and (A) in block 4. We obtain the:- following arrangement in 
blocks of size 2. 


Block 1 


Block 2 Block 3 Block 4 


a l b l c l a i b o c o 


a o b i c o a oVi a o b o c o a o b i c i 


a o b o°i a o b i c o 


In such confounding arrangement, in addition to (ABC) and (a) 
which we have confounded to get these blocks, the interaction 
(BC) is automatically confounded. The interaction (BC) is 
called the generalized effect of (ABC) and (A). In general the 
generalized effect of any two effects is a treatment contrast 
wherein the coefficient of a particular treatment combination 
is 1 if that treatment combination occurs with the same sign in 
both effects and is -1 otherwise. It is easily verified that the 
generalized effect of any two effects (X) and (Y) where X and Y 
contain different selections of the factors A,B,C, etc. is 
obtained by multiplying the factors in X,Y together and equating 
the square of any factor to unity. For example 

(A) (ABC) = (A 2 BC) = (BC), (AB) (BC) = (AB 2 C) = (AC). 


The general rule exemplified by this case is that if any two 
effects are completely confounded with block differences so is 
their generalized effect. The analysis and constructions of 
confounded, partially confounded factorial experiments are well 
treated in Kempthorne (1951), Federer (1955), Johns (1971) 
Raghava Rao (1971), Das (1964) and Das and Rao (1967). In the 
following sections we shall discuss the analysis of some 
standard incomplete factorial designs which are widely used in 
scientific experimentation. 


6.4. NESTED DESIGNS (FIXED EFFECT MODEL) 

Here we shall consider treatments which contain two or more 
levels of two or more factors and we are interested to find out 
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T-plationship of factors in an incomplete layout called 
th© , 

. *The levels of a factor B are said to be nested 

jesting- 

•thin the levels of a factor A or simply B is nested in A if 
level of the factor B occurs with only a single level of 
e factor A. We call two factors A, B crossed if neither is 
ested in the other and completely crossed if every level of 
one occurs with every level of the others, that is, they form 

a complete two-way layout. 

Consider a genetical experiment with I sires where each 
of the I sires is mated with J dams and n daughters are 
chosen from each of the IJ litters and we study the variabrlrty 
of the lactation yields of daughters. This is an ex '* m P 
nested design with two factors A (sires), B(dams) -d^is^ 
nested in A. bet x denote ^e lactation yie ^ random 

daughter of the jth dam mated with the ith^sire. ^ ^ 
effect or mixed effect models are fixed-effect 

designs let us first consider th ^ ^ ^ ^ 

model for nested designs. The 

experiment is 


. = y + a. + t. + e ijK 

ijk 1 


(6.18) 

n where V is the grand mean, 

• i- — « 4*V» 


= 1.J = . . J: * l '''t is the effect of the jth 

is the effect of the it > s _ ij ^ ^ effect of the 

am mated with the ith ^ ith sire. In the 

th daughter of the jth a obser vational errors which are 

ixed effect model the * h dist ributed with means 

issumed to be independently assume tha t «i - • 

md the same variance o • 

: t = 0 for each i- q£ dams mat ed with 

1 ot necessary that the num sire to sire . 

It is not n same ^ they may vary ^ may also 

different sires ^ daug hters on di ^ mated with ith 

Similarly the num Slaughters on th J sire be j., 

vary. Let the number o*j- « ^ ma ted with the 

sire be n.. and the number 
Write 






-ft. 
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■jr/v 1=u = y + cx. + t 

* tX ijk' ij M 1 J 

X = y. J +e.. 1 # (( 

•ilk M ij U* 

J . I 

n = Z ' n . / n - Z n., 

1 j=l J i=l 

k = 1,..., n. ., j = i = 1#•••/!• Suppose we impose t 


restriction that 

n. a. n. . t. . 

2 - 1 - ■ - - =o Z ^ = 0 for all i . 

n . n. 

i J 1 

Then with = y + a ± + t ± . we obtain 

n. . u . . n . . y . 

E = U+ <V Z Z = y 

j i i J 


( 6 . 20 ) 


( 6 . 21 ) 


The fixed effect model for the general 2-factor nested design 
is then 


X ijk = y + a. f tj + e. jk 


( 6 . 22 ) 


where y , a^, t^ are as defined in ( 6 . 21 ) subject to restric 
£ions ( 6 . 20 ) and the e_ k are independently normally 
distributed with means 0 and the same variance a 2 . With this 
model we are generally interested in testing the hypothesis 

H a : a11 a i = h t : a11 t ij = °* To obtain the err<?r sum of 
squares (to estimate cr ) we have to minimize 


s 2 = Z Z Z (x. . - y ..) 2 
i j k ^ 


(6.23) 


with respect to^y_. The least square estimates of the y. . 

are evidently y_ = x.^ = 2 x i:jk /n and the error surr^of 

squares is * 


? ? I (X ijk-> ] ij )2 -II . 1 < x i( v - > 2 . 


i j k 


i j k 


(6.24) 


and is easily seen to have Z Z(n..- 1 ) degrees of freedom. 

i j 

Under H a the minimum value of s 2 is 

? ? ?* X iik ' x ii > 2 + E n, 6 c 2 _ n x 2 ffi. 


i j k 


+ Z n 6 c _ nx‘ 

i * -L #» 


(6.25) 
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n <5 under H,p the minimum value of is 


l i £ (x ijk " X 
i J k 


ij- >2 + Z 1 n ii x ii - * ».x. 2 
J i j 1J • i J. i. 


(6.26) 


= S Z I 
i j k 


x. .. 
nk 


x. = i i 

j k n i 


Thus from (6.24) and (6.26) the appropriate invariant'test of 
Hrp (st level a) rejects H_ whenever 


d 3 q 2 

d 2 q 3 — F l-a' d 2' d 3 


(6.27) 


where d 2 ,d 3 ,q 2 and q 3 are as given in Table 6.3. Similarly 
from (6.24) and (6.25) with the additive model (6.18), one could 

use for testing H the invariant test which rejects H whenever 

OL a ■ 


d 3 q l 

(I-l)q 3 - F l-a' I_1/d 3 


(6.28) 


where q^ is as given in Table 6.3. However a more appropriate 
invariant test of H a , which is valid whether the assumption of 


additivity holds or not, rejects H whenever 

a 

(I-l)q 2 - F l-a' I_1 ' d 2 * (6 * 29) 

However, when all =0 (6.28) and (6.29) are the same. 

Table 6.3. ANOVA table of a two-factor completely nested 
classification with fixed effect model 


Sum of squares 







IF1S analysis of variance 

. the random effect model is more re ? u stlfc 

As for nested design ^ p statisti cs given in Table 

than the fixed e ^ ^ nQt got much practical intere % . 

£ ° r ^/^nlLr the analysis of a nested design with random- 
We shall consid discuss the general concept 

effec t model in Sectron ^ ^ seotion . 

of random-effect models m the 
6.5. RANDOM-EFFECT MODEL ' 

, classification with linear model 

Let us consider the one-way classing 

(see (3.94)). 

E(X ) = U + a., or x. • = U + a ± + e i j' j = 1 . N i' 1=1 . p ‘ 

(6.30) 

If we suppose that the a ± is a random sample of size p (instead 

of unknown constants as in the fixed-effect model.) from a normal 

population with mean 0 and variance c? A and the e ± . are 

independently normally distributed with mean 0 and variance o 

and are independent of the a^ then (6.30) becomes the random- 

effect model of the one-way classification. Since from (6.30) 

2 2 2 2 

Var(X. .)= o + a .a, and a are components of the; variance 
ij A A 

of X. . and are called variance components. In the fixed-effect 

ij 

model we write a i = ou. 

Example 6.3 . In an experiment to study the variation of 
lactation yield of daughters of different dams, a genetist might 
pick p dams at random from the population under study and then 
choose ^ daughters from the ith dam selected for the 
experiment, in such a case x\ will denote the lactation yield 
of jth daughter of the ith dam. We assume that this gives a mean 
lactation yield y, a 2 denote the dam-to-dam variation and a 2 
measures to variability of daughters' yield from the same dam. 

^ ple 6 ’ 4 -* Su PPOse that an experiment is tried in a 
actory with p workers, selected randomly from a large number of 

a larae W C produce small parts of some kind. Suppose also that 

any wor^^rr ^ ^ ^ th * and f ° r 

considerable day-to-day,variation. 
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SupP ose that t * ie worker is assigned for n i days during 

xperinvent. Denoting by x^ the output of the ith worker in 

i-h day that he uses the machine we can write 
the J tn 1 


tne j 

* = u + a i + e ij' j = 1 . N i' 1 = 1 . P ' 

e + a , j_s the true mean output of the ith worker, an 
Realized daily average for the ith worker after he has acquired 
L* Stability with the machine following a learning stage and 
^ is his error on the Jth day. The variability of outputs of 

U* J - « * ■ ■ T _ * J. yyy 


_ is ms ciiu*. — -- J , , , 

thi ith worker about the true mean V + <*i is measure y 
(assuming that they are the same for all workers xn the 

population). 


.ationj . 

Iiv tested in a random effect model of a 

The hypothesis usual y obviously H o is true if 

one-way classification is ^ ^ ^ ^ clasIif ication if 

and only if i it a balanced classification, 

all Nj. are equal, then with several factors is 

Generally a higher-way classi lon „ in rt e cells are 

called balanced if £actors is called balanced if 

all equal. A layout with is the same for every 

the number of levels of a . which it is . nested, the 

combination of all other Actors crossed: 

factors (if *«*> —-lemons is the same for every combine 
and if the num er experiment. 

tion of other factors in d ^ ^ 

wall now consider the analysis of a . n). 

We shall now _ mo ael- (N n - ••* P 


v, a n now’ consi— •— . _ = n = n) . 

We shall now t model- («,-••• P . 

Classification with «- S-ral case of unequal 

Wald U 940 ) solved this prob £or practical 

but his solutions are 

computations. maximum likelihood 

_£ Table ^ 

• „ the notation of 

Using the nu variance are gi 

. _ the components or 

estimate of th 


o 2 = Lp(n-l)] _1 q 2' 


(6.31) 

A 2 ^ a 2 =r (p-1) ^1 

/ n °A + ° ' mr>le tO fl nd 

. relatively simp 

■f ication it lS r in mo re complex 

In a one-way classify * 1 tes . However, m ^ maxitnu m 

these maximum likelihoo model the solutio 

experiments with random effec 


M2 
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likelihood equations are not so easily obtained. The 

Sf 

approach, to obtain tests and estimates in random-eff ect " 
in balanced cases, is to consider the mean squares i^ the 
corresponding analysis of variance table under the fix©^ 
model. It may be added that the statistics usually sugg'*^ 
with the random-effect models are not necessarily the onea^ 
with the fixed-effect models. U ® e d 


To explain this fully let us first find out the di st 


of 


ribu Mo„ 


0, = Z n (X. -X 
1 i ^ # 


) 


Q, = E E(X ij- X i.) 


i J 


( 6 . 32 ) 


for the balanced one-way classification with random-ef 

i£ec t model 

In the random-effect model (as aJ so we have observed in n\> 

i Ch apte r c 

in connection with the randomization model) p‘ E a - a 

i i ' a . cannot 

be taken to be equal to zero. Since the e_ are independently 

normally distributed with mean zero and variance o 2 

2 


_£ - r 
2 

a i 


E 

J- 


(e ii- e i.> 


(6.33) 


is distributed as the central chisquare with p(n-l) degrees of 
freedom. Now 


Q x -* E n (a.-a + e -e ) 2 = E n (a. 

i * i. .. i i 


+ e 


i. 


- a -e ) 


(6.34) 


From, the independence of the a t and e ± . and the fact that the a 


are independently normally distributed with mean 0 and variance 


3 A' Var ^ a i + e i J = . 

M i i. An 


2 2 

n of+o 

A 


= E 
i 


( a i ♦ e, -a -e 


) 


Thus 

2 


■ 


° 2 ♦ n** 1 o 2 


(6.35) 


is distributed as y 2 

A p-1* 


iously from our assumption in the random-effect model 


the (a i -a ) and the ( 


assumption of independence of the 


e ij“ e i.) a ^e independent. From the 


e 


Cov(e ij- e i. 


e i.- e 


) = E(( 


ij 


e iJ- e i. )e i. ) - Ete .. 


_ 








-1 


f(( e lj - e i. )e i- ) ' E(p ' (e ir e i. )e i.) = 
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0 


21 


\ n — 1 I 2 . 1 ^ 

)e. ) - 2 E(e i1 ) ” ~~2 1 

1 n J j'/j 


a,-a +e -: “ e 


n-1 2 n-1 

—2 0 - 2 = 0 • 

n n' 

,b»s T-. -l. .. * nd e lJ * e i. are independent and hence Q r Q 2 

* re independent in the random-effect model. From (6.33) and 
( 5 . 35 )» to tePt the h YP othe 3is we use the statistic 

r = (1 + n -|>F (6.36) 

(p-1) Q 2 <r 

where F has central F-distribution with p-1, n (p-1) degrees of 

no A 

freedom and under H ( 1 + —— ) F = F. Thus the level a test 

Q O ^ 

of H consists of rejecting H if 
a “ 


p(n-l)q. 


> F 


(p-1)q 2 - l-a,p-l,p(n-l)* 

2 2 

The power of this test depends on 0 = o^/a and is given by 


(6.37) 


B(9) = Pt (1 + n0) F > F lHE#p . 1#p(n . 1) > 


= P{F > (1 + n0) 


-1 


F l-a,p-1,p(n-1)* * 

and it involves only central F-distribution. 

No doubt the hypothesis H^jO^ = 0 is not a realistic one but 
it can be treated as the limiting case of a more realistic 
hypothesis H(0 o ):Of0Q, where 0 Q L 0 is a preassigned constant. 
The hypotheses H a , H(0 Q ) correspond respectively^ the 
fixed-effect modfl hypotheses l a* = 0, l a ± /o < 0 O and 

is e special case of H(0 Q ) when 0 Q = 0. The uniformly most 
Powerful invariant test of ^(0 Q ) (see Lehmann (1559), p.288) 

I rejects H(0 Q ) whenever 
Pln-1)q x 

I Tp-l) q 2 


> c 


(6.38) 






• y ■ » — \ 
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Where the constant C is determined by the condition that 
P((l ♦ P6)F » C ) < a and is equal to a if 0 = e o . ^ 

where a is the level of significance of the test precede 
Thus with 8 - 9 0 '6.39) reduces to 


391 


P((1 + n6 o* F - = 0 

and hence C = (1 + n0 o ) F l-a.P-1 ,P(n-D ’ The power o£ the t e , t 
in (6.38) is 

l+n6 0 , 

$(6) = P{F > 17^0“ F l-a,p-l,p(n-l) 

and it involves only central F-distribution. In the particui ar 
case 0 = 0 the test (6.37) is equivalent to the fixed-effect 
model F-test though, of course, they ere solutions of diff erent 
problems and have different power functions. 

The uniformly most powerful invariant test of H(0 Q ) : s > 3 

rejects H(0 Q ) whenever (1 + n0 Q )F < U + n6 o ) F l-a,p-l.p(n-l)‘ 
The tests of the hypotheses 0 <Q Q and 0 > 0 Q are not only 
uniformly most powerful invariant but also uniformly most power¬ 
ful unbiased and also the uniformly most powerful unbiased 
test exists for testing 0 = 0 n against two-sided alternatives 
0 / 0 Q (see Lehmann, 1959, p.288). Unbiased estimates 0 ,o A< cf 
2 2 

0 , 0 * are obtained from the equations 

q 2 = p(n-l) a 2 , q x = (p-1)<n o 2 + o 2 ). (6 - 401 

The analysis of two-way and higher-way classifications with 
random-effect models can be similarly treated and we refer 
Scheffe (1959) for details. 


6.6. NESTED DESIGNS WITH RANDOM-EFFECT MODEL 

f 

We shall consider now the analysis of balanced 2 -factor 


nested 


designs with random effect models. The extension to the 

^ to 

case can be easily carried put by induction with respec 
number of factors. The random-effect model of a two-f aC 


balanced nested design is 


ij* 


+ a i * fc ij + e ijk' 


(A 


41) 
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x, j = 1 . J ' k = 1 . . Where the V tn o t and 

,* ltm "' ar e independently normally distributed with means 0 and 

CijK 3 2 a 2 and a 2 respectively, with this model the 
ti* 6 ies <V T 
^an ce * ,„fprest are 
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the t. 


res i 

i 3 ° if interest are 


of; = 0 

T 


we have to 


u • i Om — u (6.42) 

" 2 * 0 .- H t t ' 

H 3 : , e in the case of one-way classification 

p distribution of (see Table 6.3 for notations) 

find£irS J(x -X ) 2 . 0 2 = E r n(x " ' 2 

s l .. z 


Ol 


i J 


C i j • 


-X, ) 


. I tz < x ijk - X ij- ) 

' . i V 


(6.43) 


°3 ' i j * 

41) and (6.43) 

pros < 6, 


, s I nJ( a i + fc i. + e i.. 

h < 


- t - e ) 


( 6 . 44 ) 


j * I £ n(t ij + ®ij • 


-t. - e 


i.. 


> 2 . 


“2 


i J 


,, * E 1 l (e ijk ' e iJ- > 

i j K 


JL - notation is use* to 


** to the subscript i»*o*£ « J fre edom. 

ed as * iS ^^^ endently normally distributee and 
i the t., and e. jk are r V '6.45) 

He. 


(e. ( . . e. . ) = 0, E<(e 1Jk ' e ij. )e i-- > 

ij* lj * • , ont iv distributed, 

conclude that Q 2 .Q 3 are independently 


(6.46) 


(t il + e ij.> ■ °T + ° , chisquare with Kd-D de ^ ee3 

(na 2 +o 2 )is distributed as cent and the 

er T JnrP the a,, tne ij „ _ nd that of 

freedom. Furthermore since * dence of 

ie Pendently distributed, the n hat 

0 3 follow from (6.45) and the 


ij * = 0 ' 

) - o, E(e.j 

la* 

= 0 . 


t ) t ) * 0# 
E((t ij ' i 


ij. - e i )e 


e ) e 
' e i.. 1 


= 0, 


l *' e ij. )e . 




'i 


v 

J 


i, 

w 


it:? 

s 

£ 




✓ 


« 
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Since 


. ) = 0 ? * r l 4 * t ' 1 ”' 1 * 2 


Var(a ± + fc i. + e i.. 


' . 2 ^ na 2 + a 2 )" 1 Qi is distributed as 

we conclude that (nJa A * T 

_ -i _ ml* . ._ 


me —- ri 

, , T i ric>rrrees of freedom. Thus the 

central chisquare with 1-1 degrees 

+. «i rpiects H' whenever 

appropriate level a test of H T rejects T 


IJ (n-1) q 


xu \'in 

I (J-1) q’J” - F l-ot/I (J-D 


(6.47) 


and the level a test of H & rejects whenever 


I(J-l)q 1 

a': - F l-a,I-l,I(J-l) • 


(6.48) 


( 1 - 1 ) q 2 - 1 -a, 1 - 1,1 (J-i; 

As in the case of random-effect model of the one-way classifica¬ 
tion the power of each of these tests can be expressed in terms 
of- central F-distribution. 

The hypothesis H a and are not realistic in actual 
practice but can be expressed as the limiting case where e Q 0 
of the following realistic hypotheses 


l* : n — 0 < 6 , 

& o 2 + n o 2 " 0 


.* °T 


T ' a 2 


i e o • 


(6.49) 


The uniformly most powerful invariant unbiassed level a test of 
(see Lehmann(1959) p.291) whenever 


I(J-U q. 


11315 q^ - ( 1 +n 0 O )F l-a / I-l / I (J-l) 


(6.50 


and the uniformly most powerful invariant unbiased level a test 


of H* rejects H* whenever 

B a 


IJ(n-l)q. 


I (J-l)" q 3 - (1+nJ V F l-a,I(J-l) ,IJ(n-l) 


(6.51 


The unbiased estimates a 2 , a 2 a 2 of the variance components 
2 2 2 a l 

°A # a T* a are obtained by solving 


q 3 = IJ(n-l) o*, q 2 = I (J-l) (n S 2 + a 2 ) 


q l - (1-1) (nJ a 2 + n a 2 + a 2 ' 


writte 


where 








(6.48) 

sifica- 

terms 


e 0 •> o 


(6.49) 
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7 CRO SSED and nested designs (MIXED effect model) 

6 »'' 

nS ide r an experiment with three factors A,B,C,A with I levels, 

•ith J levels, C with K levels, in which the factor B is nested 
0 w 1 *- 

viin A and the factor C is crossed with factors A and B. Asjan 
ara ple consider a chemical process in which a feed stock is 
e treated in one of I different ways to form an intermediate and 
^ the intermediate is processed in the next stage at one of K 
Afferent temperatures. Suppose that we prepare J batches by 
^ . of i pretreatment method and from each batch we t^ke Kn 
^ les -n of'these are treated at each of K temperatures. In 
^fixed-effect model the observation x 1Jkl , on the l-th 
! *" ft 1 n) which was treated at the kth temperature 

; I l ..K) , taken from the Jth batch 1 ^ 

prepared by the ith pretreatment metho, < ' •’ 

| written as (6.52) 


mm 


with 


^ijk + e iJ kjl 


(6.53) 


A . pjB + of + +-a 3k(i) 

= V + “i + “j (i) ^ lk 

A . the ^in effect of the ith 
ja o£? is the leVe 


3 = u = * ' i j V-*-/ 

Where u is the -an. i ^ 

r el ° £tbe B or the 'ith 3 level of A-°k 
the factor B for c . since 

the Kth Xevel of 


ls the rarn j leVel o£ 

is the mam effect 

* . a? is the 


:< = E< X i.. 
1 


0L C = E (X 




) ' » 

i 

- w) < 


= * (Xy . . 


' X i 


C . o. There are two 

.A = a B „t = "• non-additi vlt T 


K • * = a (i ) ’ * he non-aaux-* 

k can add restriction. represents^ ^ that changing 

interaction terms < x > * the factor C - # greate r change 

between the factor ^ Qthe r 9 "* # parti cular »eth 

from one pretreated ^ ^ having 

in response to te * by ° thetS) ^evel of *• 

than those pf etre , on fo r 1 b 
i similar interpret 1 
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= E(X, k 

ik 1 * K * 


- P - a i 


conclude that 


AC _ 


AC _ 


= 0 . 


a°) = 


- X - X + x 

J. • • • • • 9 


Similarly 


a .k(i) 


= a i. (i) 


= 0 . 


between the .factors A and B. Since 


— - - X-°'1 -n.er talK about the TOin 

factor B is nested in a we 

** . from method to method. Two observa- 

effect of batches changing from memo 

4 -nm^rai-ure on two different pretreatment 
tions made at the same temperat 

»■»_ ■ 4-ha-t- thev are made at the same 

stocks have in common one thing that tney 

temperature. The first batch made by a particular method and 
the first batch made by another different method do not have a 
batch factor in common, as there are just b batches made by each 
method and.being a particular one or the other is just a label. 
Since the interaction (ABC) corresponds to the interaction AB 
varying from temperature to temper at ure, there is no interaction 
(ABC). 

We can analyse *this design by treating it as a two-way 
classification with batches and temperature/ the batches being 
at IJ levels and temperature at K levels and there are n 
observations per cell. The analysis of variance table for this 
classification is given in Table 6.4, 

We did not write down specifically the null hypotheses 
involved in obtaining the different sum of squares in Table 6.4. 
The reader will have no idifficulty to figure them out if needed. 
For example the null hypothesis to obtain SS (c) is all a£ = 0 , 

The SS(B) has been divided into two component SS (A) , 

) representing the Siam of squares between methods; the sum 
Of squares between batches within methods. Similarly SS (BC) is 

t tW ° C ° mPOnfintS SS (AC> and SS(BC|A), where SS (BC) is 
iqnoringtatr^ int6raCtion «»t we would obtain by 

methods (1 levIi s rand°t Blderin9 3 tW °‘” ay Ossification with 
per bell andssinclM em Peratures (K levels) and Jn observations 

W is the sum of squares for- interaction 




analysis of variances: 

wee n temperature and batches made by the same method 
thin batches). 


9 




Table 6.4. Analysis of variance of the three-factors 
crossed and nested design 


Sources of 

variations 

d.f. 

Sum of squares 

Ejected mean sun 
of squares (under 
mixed effect model) 

Me thods (A) 

1-1 

SS (A)= E JKnx 2 -C.F. 

1 • • • 
i 

° 2+nKo B(A) +JKno A 

Batches 

(within 

methods) 

I.(J-l) 

SS (B |A) =SS (B) -SS (A) 

° 2+Kno B(A) 

Batches (B) 

IJ-1 

SS(A)= E E Knx.. —C.F. 

i j 1J * * 

— 

Temper a- 
tures(C) 

K- 1 

SS(C)= l IJnx 2 v -C.F. 
k 

U 2+ u°BC(A) +IJn °i 

Inter¬ 
action (AC) 

(1-1) (J- 

1) SS (AC) = E £ Jnx 2 k ^- c * F 
i k 

1 

• o2+n0 L(A)' +IJn °C 

Inter- 

action (BC) I(J-l) (^ 

within 

methods 

Inter- (tj _i) (K 

action (BC) 

1) SS(BC|A)=SS(BC)-SS(AC) _ 

_1)SS(BC)=I l l n x i j k." C - F- 

i ] i 

Error 

IJK (n- 1) 

SS (E)=s 2 -SS (B) -SS (c) 

-SS (BC) 

£ 

a 


Total 


IJKn -1 


s 2 = i i l l x ijkr c,F 
i j * ft _- 


c.f. = 


jkft 


n.tie to assume that 

actual problems it 1* ^^distributed with means 0 

ere independency n° BC „« independent ; 

1 .»»ri—»iwsi d -. >*» 

» V" “ e ljk e « 

•J,11 ’ tM ><“ 
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assumptions in model (6.53) reduces it to one what is called th 
mixed-effect model . Under these assumptions the expected value s 
of different mean sum of squares are given in Table 6.4. We 
refer to Scheffe (1959), Section 8.2 for details. The notations 
used here are 


of = (I-l)' 1 Z (a A ) 2 , 
A i 1 


°b (a) --dtr-ur 1 z Ua B Ui) ) 2 , 


a 2 = (K-1) 1 Z (a^) 2 , o^, (A) = [ I ( J- 1) (K-1) ] " 1 £ Z Z(a“ #4l ) 2 . 


i j k 


jk(i) 


6.8. SPLIT-PLOT DESIGNS 

In agricultural experimentation sometimes it is necessary to 
divide a piece of land into several strips and then apply to each 
of these strips a different treatment. For convenience we shall 

i 

call these strips as main plots and the treatments applied to 
them as main treatments or levels of the main factor. The main 
plots are subdivided into subplots and one subtreatment is applied 
to each subplot. In a split plot designs the main effect of the 
main factor are completely confounded with main plot differences an 
it is a special case of the crossed and nested classification. 

Example 6.5 . The following experiment was conducted in 
Sagar-Bishnupur, West Bengal, India, to study the combined effect 
on rice crop of nitrogeneous fertilizers at four different levels 
(taken from two different sources each at two levels) (source 
I containing 30 lb. of nitrogen per acre), N 2 (source 1 containing 
60 lbs. of nitrogen per acre), (source 2 containing 30 lbs. 
of nitrogen per acre), (source 2 containing 60 lbs. of 

nitrogen per acre); and phosphorus and potash mixtvires at 4 
different levels 

P x s 0 lb of phosphorus in the form P 2 0^ an< ^ ^ lb P otas ^ 
the form K 2 0 per acre, 

& 2 S -0 lb of phosphorus in the form P 2 0g an< ^ 90 lb °f potash i n 

the form of K 2 0 per acre. 
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0 lb of phosphorus in the form P 2 0 5 and 30 lb of potash 
3 in the form K 2 0 per acre/ 

P . 30 lb of phosphorus in the form P 2 0^ 30 lb of potash 

in the form K 2 0 per acre. " 

rhe layout adopted was a split plot design with 4 replications. 
The actual layout and yield of paddy in kilogram per plot is 
given in Table 6.5. 

Table 6.5. Layout and yield of rice c rop 

R eplication 1 Replication ^ 

treatment Suhtreatroents Main treatmen t SufeiieatESBtS. 
-- ' D P. P 1 V 0 


*2 

X 2 6.3 


1 -3 

4.8 5.0 

5.1 

N 4 

5.6 

p 4 P 3 

4.2 4.2 

P 1 

4.9 

N 1 

P 2 
5.2 

P 1 P 4 
5.2 4.6 

P 3 
5.3 

N 3 

P 1 

5.3 

P P -X 
v 2 3 

4.1 4.9 

P 1 

5.5 

n 2 

P 4 

5.8 


P 1 v 3 2 

5.0 5.4 4.6 

P 1 P 3 

5.1 4.3 5.6 

P 2 P 4 P 3 
5.0 5.1 4.8 

p 2 P 3 P 1 
4.5 4.6 5.1 


pgnlication _-3 

sjain^^ Subt.e.t^eBts 

P 2 Pl 6 ? 2 

» 4 sU 4 - 8 5,2 


p^plicationj. 

. a 4- nents Subtre^tments 
M*in tre at menus- 

P P 3 P 1 P2 

s 2 5. 4 3 

P. P 3 P 1 
P 4 3 _ - 

2 a 5 3.8 ^. 1 






9 


6.65 





















Table 6.6 (contrt.) 
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> p lots 

P fertil izers 

interaction (NP) 

5 ub plot error 


11.42 


0.18 


3.65 


9.81 


0.22 


1.48 


. 9 # LATTICE DESIGNS 

Lattice designs were first introduced by Yates .(1937a) and have 
been investigated extensively by Kempthorne, Federer and their 
associates. We refer to Kempthorne ( 1952 ) and Federer (1955) for 

an exhaustive account of these designs. In general la 

i 4 -~ block designs where treatments are notre.ated 
designs are incomplete block d g But in 

. . wav as are a factorial set of treatments. But in 

"/conies they can also be treated as Incite f-oriai 

experiments . „ aocia te PBIB designs with Lj 

The lattice designs tW ° * the number of treatments 

association scheme. » ^ a „ ange the „ 2 treatments 

.V is a perfect square. V llca te two treatments are 

in an n X n array, m the in the jt h column of the 

in the jth block if and only ^ £orm £he weeks. For a 

I in the second replicate superimposed on the 

I array. I n tne snUare of order n is SU P 

I . .„, te a latin square Vhiock (of the 3rd 

third replltate^^ m ^ ts ^ , n the same bio ^ ^ £etter of 

J array and two correspon afferent 

I , . if and only n ncations we use ai 

I replicate) it ^ £urther replicat 

I the latin square. s of °r der n ’ ^responding to 

I orthogonal latin first one rows 

I . with two replicbtes. corre sponaing to * 

| a design wi tn the sec ond one £ de sign. * hus 

I the columns as bl £ ^ cal led a si ^ * pBlB aesign with 
I =f the array as ^ a tw0 associ ^ „ lt h b 3 • 

I simple lattice <*e ^ associate P 

* Association schenve. 












heme is called a trrple coluItins of the array form the blocks 0 
s three replicate^, t e ^ the array form the blocks of 
e first replicate, the q£ the sup erimposed latin s qUai 

cond replicate and t e licate . Simple lattice design j 

rm the blocks of the thxrd rep ^ r = 2 , k = n , 

PBIB design with parameters v 
= 1 / X 2 = °' 

1 oni 2 , 2 \ _ / 2 ,2 (n— 2 1 

. , P 1 ) . ( P 2 = (Pij) - <2(n-2),(n-l)2 + 


lattice design. A triple lattice <a esi 


= ( ii } ~ ( n- 1 , (n- 1 ) (n- 2 ) 


- v,2 


triple lattice design is a PBIB design with parameters v - n 

= 3n, r = 3, k = n, = 1/ X 2 = 0# 

n # 2 (n- 2 ) 2 . 6 ' 3(n_3) * 

_/ ),P =( 2 ' 

2 (n- 2 ) , (n-2) (n-3) ' 3^n-3) ,(n-3) +1 


Exami 


,le 6.6. Let n = 3. Consider the array 


1, 2, 3 

4, 5, 6 

7, 8 , 9 . 

5 blocks of the first replicate are (1,4,7), (2,5,8),. (3,6,9 

» blocks of the second replicate are (1,2,3), (4,5,6), (7,8, 

;se two replicates form a simple lattice design. Superimpos 
; latin square 

а, 6, Y 

б , y, a 

Y# a, 8. 

the array. Th^locks of the third replicate of the triple 
tice design, are (1,6,8) , (2,4,9), (3,5,7). 

If the basic I> ± lattice design (i.e. lattice design with 
ociation scheme) is replicated s times then we get an 
omplete block design with parameters v= n 2 , b = nsi, r = s 
X s, *2 ~ 0 w hich can b,e easily shown to be a two 

9 6 PBIB des1 ^ with Ll association sch e m 6 . 







bl °cJc s 
° f <=h s 

r squa ^ 
desi ® is 


a cubic (three dimensional) lattice design v = n 3 

ntS "* k arra " ged lnt ° 3 -Plicates of »3 treatnents in 
ate bloo*. of Site Ihe construction and analysis of 
esitrns are fully discussed by Federer (1955) (see also 
19 39J / • 


It can be established that a cubic lattice design with 


v* ^ iS a th ree-associate PBIB design with parameters 
u:s n , b = 3n , r = 3, k = n, n x = 3(n-l), n = 3(n-l) 2 

\ _ r\ -v ^ 



= (n-1) , = 1 , X = o, A = o. 


P* = 


n- 2 , 2 (n- 1) ,0 

2 (n-1, 2 (n- 1) (n-2) , (n-1) 2 


, (n-1) 


, 2 (n-2) 


, (n-1) 41 (n-2) 


, (n-1) 


2 (n-2) , 2 (n-1) + (n-2) % 2(n-l)(ri-2) 
(n-1) , 2(n-l) (n-2) » (n-1) (n-2) : 


0 , 3 , 3 (n- 2) 

3 , 6 (n-2) , 3 (n-2) ' 

3 (n-2), 3 (n-2) 2 , (n-2) 3 


6.9.1. Rectangular Lattice Designs ; The rectangular lattice 
designs were first proposed by Yates ( 19 37a, 19 37b) . In these, 
designs the number of treatments is of the form v = sn and has 
incomplete blocks of size s in one replicate and has incomplete 
blocks of size n in the second replicate. The analysis and 
discussion of rectangular designs for v = p(p-l) treatments in 
incomplete blocks of size p-1 in all replications are given by 
iarshbarger (1947,1949,1951). A rectangular lattice design 
rith v = p(p-l), where p(p-l) symbols are identified by a pair 
(i/j), i/j , i, j = 1, ...,p and all symbols having the same value 
:or i form the i-th block of the first replicate and all symbols 
aving the same value for j form the j-th block of the second 
©plicate, is called a simple rectangular lattice design. Nair 
1951) has shown that such simple rectangular lattice designs o 
four-associate PBIB designs with parameters v - P P- 
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, , „ _ 9 A = 1, A„ = u, ' o 

k =t p-1, r - '2 J 


= 1 , 1 , = 0 , X, = 0 , X 4 = o, ^ = 2(p-2), 


n 2 = (p-2)(p-3), n 3 = 2(p-2), n 


pl = 


p-3, p-3 ,1*0 

p-3, (p-3)(p-4), p-3, 0 


1 , P-3 

0 , 0 


, P-3, 1 

, 1,0 


= 1 , 


, P 2 = 


2 , 2(p-4) ,2 , o 

2(p-4),(p-4) (p—5), 2 (p_4) f x 
2 , 2 (p-4) ,2 , o 

0,1 ,0 , o 


P 3 = 


1 , P-3 


, P-3, 1 


>-3, (p-3) (p-4), p-3, 1 


p-3, p-3 
1,0 


, 1,0 

, 0 , 0 


, 2 (p-2) , (■ 


, (p-2) (p-3) ,0 


2 (p-2) , 0 


In a triple rectangular lattice design v = p(p_l) and each 
treatment is identified by ordered triplets ( 1 , j ,k) , i * j £ 

*' 1 * ' 1 "‘" P SUCh that 311 treatment s having the same i value 
form the rth block of the first replicate, all treatments having 
the same j value form the jth block of the second reni ■ <- 
all treatments having the same kth value form the kth bT t ^ 
third replicate. N air (1951) has shown that a triple latt ” 
design is a two associate PBIB designs for p - 3 *1 

associate PBIB design for p = 4 . For p „ three ' 

designs. p — 3 tney are not PBIB 

The analysis with and without- 

information of lattice de s d 1 gns h d °l/ eCOVery ° f ^ter-block 
of triple rectangular lattice aJ ^ * b ° V * (,xo n* the «se 
-V - the PBIB designs. deS19nS > ^ ba Seated in the sam 

For fc ^e analysis 

of designs discussed in ‘'° Variance < “issing val 

thteresting desion 1S ° hapter and for som efficien 

T _. y ues igns we refpr *- tor som e other r^i 

Joh »s (1971) and Rar * ' t0 Ped «er (i 955) „ hSr relevant 

d Raghava Rao (19 355 >. Kempthorne (1951) 
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f wifc n a View to Study the effect of application of nitrogen, 

| gphorus and potash independently and in combination, a 

V torial eXperiment With two levels of nitrogen in the form of 
ionium sulphate, two levels of phosphorus in the form of 

5 uperP hosphate and tW ° levels of potash was conducted in a 
ndomized block layout with four replications on paddy crop in 
ire nali' Andhra Pradesh, India. Details of the experiment are 
given below. 

Two levels of nitrogen: n Q =0, n L = 44.8 3 kilogram per acre. 
Two levels of phosphorus:' = 0, p^ = 67.25 kilogram per acre 
Two levels of potash: k Q = 0, k.^ = 67.25 kilogram per acre. 
Plot size 10.4 x 9.0 square meter. 

Yield of paddy in kilogram per plot: 


Treatments _Replications 


r alue | 

vino I „ „ v 
" • I n o p l X l 
and. [ 

of the < n ° Plk ° 




r-o p o k i 


3.905 

2.035 

1.342 

0.64 2 

2.005 

0.562 

0.345 

1.570 

0 .941 

2.155 

0.930 

0 .480 

2.49,2 

0.533 

0.725 

2.4 37 

2.651 

1.007 

1.961 

0.289 

1.902 

2.032 

0.767 

2.202 

1.622 

0.285 

1.705 

2.232 

1.206 

vltl. 308 

1.761 

0.267 


| n „ v 2.651 J-.ou/ ----- . 

n i p o k o 

^ 2.902 2.032 0.767 2.202 

1.622 0.285 1.705 2.232 

n i p 0 k l _ 

n p k 1.206 .‘a. 308 1.761 0.267 

ftialyle the data and interpret the result. 

2 Table 6.7 gives the plan and yield of Jute fibre ink, per 
,lot of a 2 4 field experiment on jute conduoted during 1966 rn 

, . . _ 10 tons per acre, Nitrochalk ■ with levels n Q - 0. 

D = °' d l - 10 tcns p v _ nhate p in the form of P 2 0 5 with two 
!= 0.4 cwt per acre, phosphate P i 2 levels 

evels P 0 = 0, Pl = 0.6 cwt per acre and potash 
_ = 0. k = 1.0 cwt per acre. 
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6.8 gives the yields of paddy in kilogram per plot in a 

if apl® * 

lattice experiment with 25 varieties. The variety nunber 
n W ithin brackets. 

. 6 gi ve , 8m yield of paddy in two replication with simple 

Tat>l e • * ]_ a ttice design____ 



Replication 1 
Yields 


(11) 

(12) 

(13) ' 

50.8 

88.2 

50.3 

(21) 

66.5 

(22) 

41.6 

(23) 

76.6 

(31) 

69 .4 

(32) 

71.0 

(33) 

59.1 

(41) 

82.8 

(4 2) 

50.6 

(4 3) 
67.3 

<51) 

68.1 

(52) 

72.3 

(53) 

65.7 


(14) 

73.9 

(24) 

45.3 

(34) 

39.7 

(44) 

51.3 

(54) 

46.7 


(15) 

49.2 

(25) 

83.8 

(35) 

64.2 

(45) 

82.1 

(55) 

79.7 


Block. 

1 

: 1 


( 11 ) 

55.4 

( 12 ) 

67.7 

(13) 
60 .9 

(14) 
58.1 

(15) 
49.6 


( 21 ) 

88.4 

( 22 ) 

50.6 

(23) 
66.8 


Repl icatio n_2_ 
v-j pi ds 

(31) 

75.4 

(32) 

68.1 


(24) 
60 .0 

(25) 
62.7 


(33) 

59.5 

(34) 

49 .9 

(35) 

ie lding 


(41) 

94.8 

(4 2) 
48.5 

(43) 

54.3 

(44) 
79 .8 

(45) 

52.7 


(51) 
52.1 

(52) 

70 .5 

(53) 

73.7 

(54) 

47.4 

(55) 
69 .8 


vari 


. o of paddy* 
eties or v 


—- 

the data to 

Analyze tne 

^ for t ^ ve 

. i V e app r ° a btaih^^ 

8EFEKENCES hat altera ^ lgB . a" 4 ° l3 . 

1 m<? 64) • A ral facto rla ftsS oc. BUi 

tas, M.S. ‘ 19 s yzme trica calcU tta Stat. 

construction <>* £actors , cal 

maximum nu JT '^ er 

pp.1-17. 
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.HAPTER 


WWStS'?'- Linear hypo Thesis 




s * w n 6; 

y ' N.V. 
la ncee 


1° this che P te r we shall 

h ypothesis of the univariate ' Wlth mu ltivar iate „ 

:r: L“ r:: r, .n~ 

variate case although Cm “ lo “ * ** mT* 

are quite involved * tr ibuti cn of diff er<an+ . . 

vea - As we have dm a ■ erent test criteri 

genera linear hypothesis we sh-li * *** ° aSe of univariate 

of the multivariate general “ ^ reat he « two dif (erent £orn 
through the multivariate analyst ™ their testing 

techniques. The firc-t variance (manova) 

while the second o,e" 

and is suitable fcr anal mU tlvariate regression model 

applications we !hlu t T ^ —l- As 

designs. ~ ' eat soma multivariate complete block 


r< *lens lml ' GENERAL LINEAR HYPOTHESIS (ABSTRACT FORMULATION) 


Let X a = (x v ) 1 rv - 1 

ai. ap ' a - 1 

distributed p-variate normal v 


N be; N independently 


vectors with means E(X a ) = y a 


r 'n / 

I- — (y < *** , ^ J fYn^ ' anc ^ a common covariance positive definite 

DUnaing i F 

rVm-s'l- -J v V T /x4- 


matrix 


Suppl. 


X = 


k ll' • 


‘hi ' * 


Let 

. .,x 1 

• • ( Xn 


/ • • • / 


/ • • • / X. 


= (x ,... ,x N ) 


‘n 1» * * * ,u ! 


Xy - (X u .X Hi ) • , Ei * <»*li.W ‘' 1 ■ 1 . P ‘ 17-1 

A multivariate general linear hypothesis model is defined in 

,_„_ QC . tt and it of dimensions s and 

terms of two linear subspaces w 

v. fhat 0 < s-r < s < N. It is assumed 

s-r respectively, such tha _ 

«.« tl . •„««•“ * *" a - *" 

tl» „„n h»otb..l. «„ «»« l "" i€ .« 

throughout that N-S i P s 

freedom to estimate ^• 


(7.1) 


iety 







VV 




BfT 
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As in the univariate case (see Chapter 3) this problem can 
be reduced to a canonical form by applying to each of the 
N-vector X., i = 1__ an orthogonal transformation 


X -► Y = OX 


(7.2) 


where O is an N x N orthogonal matrix 


°ii'...,o 1N 


°N1'* * *'°NN 


(7.3) 


such that its first s row vectors O = (O^^, ...,Oj N ), i =l,... # s 
span it with O r+1 , ... ,0 s spanning -ir^ . Write 

Y =(Y X __ Y^) ' , Y a = (Y al .Y^) 1 , a = 1 -' N * 

Since 

(Y°V = (0%.O a X p ), 

and 

s 

E(.X.) = y. = I a. 0 1 
_1 1 i=l 1 

where a^, i = are real constants and all are not equal 

to zero, E (Y a ) = 0^ for a- s+l,...,N if and only if all e it 

and E (Y a ) = () for a = 1, . . ., r, s+1, . . . ,N if and only if vk e 
Furthermore since 


Y . = £ O X., Y„. = E 0 Q , X 

ai y=1 ay yi 3j ^ 36 6j 

and Cov (X . ,X x .) = o . . when t = 6 and is equal to 0 when y ^ 6, 
yi oj l j ^ 


Cov(Y ai'V = 


= C. , £ O O 

iJ Y=1 ay °6y 


/ if a = 3 , 


0 , if a ^ 3 . 


Thus the row vector of Y are independently distributed normal 
^-vectors with the same covariance matrix £ and under ir fi 





( a ) = 


under 7 t 


MULT 




'(say), a = ! 


GENEp, 


a = 


s+ l,. 


al l 


near 


•: pbw. . qpm 


NVPqt 


HESls 


■. ' s 


E(y a ) =J * *' TA '---'S f 

Is * a a 1 

v - ' • * * »r, sH’i t N 

pence the canonical c 

as follows: v«. 

with the same covariance mat e P^snt ly a istrtbute , ve 

E(Y“) = 0, a = S+1 . . We " <P SltiVedefinite » -*ith 

hypothesis H Q : v a = E (Y a ) 0 a ^ erested ln testing the null 
the observations Y a on Y a ~ - 1 '“" r ‘ The ' : likelihood of 

, , ~i — ,a !»•••/N, which for given v a 

depends only in v 1 .y s , E is 9 1 • 

Mv 1 .v s , l) = (2nr N P /2 (det E)- N/2 x 

exp {-Sjtr [1 ( Z C -v a ) <z a -y a ) ' + l z a y. a ']) (7.4) 


a = 


• .,s, 


• /N. 


(7.4) 


a = l 


a=s+l 


Before we proceed to find the likelihood ratio test of H q we 
need to p»rove the following lemma which will be useful in the 

sequel. 

Lemma 7.1. Let A be a synanetric p * p Positive ^finite matrix 


and let 
f (A) = 


c (det A)"' ey *P 


^ exp{ - \ tr A } 


(7.5) 


where c is a cons— 

all positive definite 

identity matrix. 

Proof • Clearly 
p N / 2 

T 6 e 

f(A) = c 11 . 1 

i = i 


is a constant. Th *" where I is the p >• P 

itive definite matrices 


maximum in the s P c>ce 


P 0 N/i exp (- *4 0 i ] tbe matri* A- 

£(A) = C i=l 1 h racteristic -ots oncltt ae 

t he charact is easy on ly 

where 0,. 9 p ** wit h respect to ^ ^ hold s H 

Differentiating when 0 1 = ’" S 

that f(A) is maX1 
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if A = NX. Hence f (A) is maximum if A - NI. 
Using Lemma 7.1 we obtain 

(2Tr)"' Np/2 (det Z)" N/2 


Q-E.d 


1 s 

Hid.IK L (^ i • • • / ^ # £ ) ITlclX 

, n 


exp{ - ^trE- 1 (I ') f (I 1 ) - NP/2 ('Set 2 iV ') ' N/2 

a=s+l a=s+l 


exp {-NP/2 }. (7.6) 

Under the hypothesis'H q the likelihood function L(v a ,...,V s , £) 
reduces to 


L(v r+1 ,. 

. .,v s ,E) = (2 it) -N P/ 2 (detE) 

s 

„ ' N 

+ 2 

(z - Z a ) (x a -v a ) • + Z 

a=r-f 1 

a=s+l 


r 
[ I 
a=l 


Z Z 3 


z z 

(7.7) 


From (7.7) using Lemma 7.1 we obtain 
r-t 1 


max L(/ +1 ,.., ( v S ,E) = max(2-rr) ~ NP//2 (detE) N//2 

Z 


TT 


0) 


exp {-i^trZ" 1 [ Z x a z a< + 2 ' ] }= (^2L) ~ N P/ 2 

a —1 ’ N 


a=l 

Z~Z~ ) )" N/2 exp • {-N/2} . 
From (7.6) and (7.8) the likelihood ratio test of 


(det ( Z Z V' + Z Z V'))" N/2 
a=l a=s+l 


(7.8). 


= 0 rejects H ci whenever 


*y • • * - z r 


■ i _ / det b , N/2 . ^ , 

~ det (a+b) ± c or equivalently u = < c 


det(a+b) - 

- (7.9) 

where the constants c,c' are chosen in such a way that the 

corresponding test has size a and a = I y a y a ' b = 2 v a v“ 

a =l x ' --o*s+l X X 


This result is due to Wilks (1932) and Hsu (1941) and is 


popularly called the Wilks' criterion Prom Theorem 2.12, and 

Equation (-2.25) A = E Y a y a ' B - r v a v a ' 

a -i — — ' “ ~ Z YY are 

a -i a=s+l ~ — 


independently distributed Wishart matrices provided that N-s ».1 

- ~ P an ^ ^ ^ as a cen tral Wishart distribution with paramete 

and N-s degrees of freedom. Under H A has central Wishart 
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on^r it is distributee! TLLT.T** ° f freed °" »»ereas 


dlst ribution with parameter z 


in applications to specify 


noncentral Wishart a- 

"ishart distribution. 


str aiohtforward to carry out the reduc T “ *' 
juf .t described, explicitly. However thTl ^ Can ° nlcal **■. 
(with values u in (7.9)) can be expressed "\ 8t ' tlstic ° 

original random variable X by the f„), ■ S ° £ the 

Y ne following argument: 


s not 


always 


Let lii ^ii* • • •, P iN ) ' and 
the projections of the vector x. = 
.on ti ^ and tt 


oo 


respectively. 


—i 

For,each i 


i U>i j. 

il'‘*“' X iN^ 


(X 


iN ; 


be 


^ • • w p 


ai M oti^ ~ ^- x -i “ Ei^ 1 - M. ± ) 


N 

E (X, 

ot= 1 

is the inner product of two vectors, each of which is the 
difference of the: given vector X, and is projection jj. on tt 0 
and it remains unchanged under the orthogonal transformation 
of the coordinate system in which the variables are expressed. 
Now OX^ can be interpreted as expressing in a new 
coordinate system with the first s coordinate axes lying in tt^. 
Hence the projection of the transformed vector Y ± on is 

(Y . Y . ,0.0) so that the difference between the 

vector Y. and its projection on is (0, ..., 0, Y s+1> • * •» Y 

- ? Y a Y a ’ is given by 


Ni 


), 


Therefore the (i,j)th element of 


a=s+l 


( 7 . 10 ) 


_ difference between the projections of 
he same way the drfferenc . 0) . Thus I 

and tt is given by (Y u -' n a=l 

w . i-_ fforpnce of these 


ana n j .^ ~ - xj. 

qual to the inner product of the difference of these 
actions. m other words 




(7.11) 


y y - (y . — PLi) ’ ^i“^-i^ * 

1. nivariance^ ^^sformed^eotor X 

ft = term- 
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a=l,...,N, the problem of testing H q : a = 0 against A ^ 0 
remain invariant under the following three groups of 
transformations: 


(1) 


a=r+l,.. 


The group T of translations which translates Y a 

n Ot / i -.x. _ 


■ s # d u = (d 
— al 

the space of Y under T is (Y 


• d aP>' 


Y a + ^ 


e T., The maximal invariant in 


y r 

• > / 


,s+l 


,y n ). 


(2) Let Z be a r x p matrix such that Z' = ( Y 1 , . . ., Y r ) and let 
W be the (N-s) x p matrix such that W' = (Y s+1 ,...,y N ). The 
group 0(r) of r x r orthogonal matrices, operating as Z ^ oz, 

0 e 0 (r) , in the space of Z and the group O(N-s) of (N-s) x ( N _ s) 
orthogonal matrices, operating as W ■ OW, 0 e O(N-s), in the 
space of W affect neither the independence nor the covariance 


between any tv r o row vectors of Z and W. 


The following lemma will establish that Z'Z = £ Y a Y a is a 

maximal invariant in the spape of Z. a=1 

Lemma 7.2 . Z'Z is a maximal invariant in the space of Z under 
0(r). 


Eroof: Since for 0 e 0(r), (OZ) ' (OZ)=Z ' Z the matrix Z'Z 

is a maximal Z,Z* invariant if we can establish that for any two 
elements Z,Z* in the same space Z*'Z*=Z'Z implies that there 
exists a matrix 0 e 0(r) such that Z*=OZ.- Let us first consider 
r — P* Without any loss of generality we can assume that the p 
columns of Z are linearly independent (the exceptional set of 
Z's for which this does not hold has probability measure 0) . 

Now Z* 1 Z* = Z'Z implies that 0 = Z*Z ^ is an orthogonal matrix 
and that Z* = OZ. 

Now consider the case r > p. Without any loss of 
generality we can assume here that the p columns of Z are 
linearly independent. Since for any two p-dimensional subspaces 
of the r-space there exists an orthogonal transformation, 

^ rans forming one to other we assume that after a suitable 
orthogonal transformation the p column vectors of Z and Z* lie 

in the.same subspace and the problem is reduced to the case 
r = p. 




vastly consider the case r < p . The first r colurm vectors 
t i can be assumed to be linearly independent. Write z = (z z 
° e^e %i' Z 2 are svtos P aces of dimensions r x r and r x ( p _ r ) 

W C «ectively and similarly for Z*. Since Z*'Z* = Z'Z we obtain 

p&S * -jc ic-ie 

*,%* = Z^Z^, Z^'Z 2 = Z i Z 2'^2 Z 2 = Z 2 Z 2* Now by the previous 
^ u.ment for the case r = p Z^' Z^ = Z^Z^ implies that there 


• s ts an orthogonal matrix B = (Z* 1 )” 1 Z^ such that Z^-BZ^ A- 
^ 2 * = Z-Z 2 implies that Z* = BZ 2 - Obviously Z^Z^ZfcZj wit 


l\' Z 2 
z* 2 -bz 2 


as B is an orthogonal matrix. 

Similarly a maximal invariant in the space of W under 0 (N 
N 2.12. W’W is positive defi 
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orthogonal matrices °]/°2 sat ^ s ^y^ ng 

°l g l Ag i°i = °2 g 2 A * g 2°2 

which implies that A* = (g 2 ®l g l^ A ^ g 3^1^2 g 2 ) =QA.g' ,where 

g = g2 l0 2°l g l e G Jl (p) ‘ Furthermore from = °2 g 2 B *^2 0 2 

we get B* = gBg' . 

Q.E.d 

Let R^, .,.,R p be the roots of det(A-AB) = 0. Then Rj, ...,R 

are the maximal invariants in the space of (A,B) under G^(p) . 

Similarly a corresponding maximal invariant in the parametric 
1. s 

space of (v , ...,v , z) is 0,, ...,6 , the characteristic roots 
_ I 1 p 

of AA'E . The statistic U in (7.9) can be written as 

_ 1 P 1 

U = det (B (A+B) ) = n (1 + R.) ” 1 . (7.12 

i=l 1 

Anderson (1958) called this statistic, U p ,r,N-s. Several 
invariant tests are proposed for this problem and they are as 
follows. In all cases below we shall use the constant c as the 
generic notation of the cut-off point depending on the level of 
significance ot of the test. 


(i) Wilks' criterion (Wilks, 19 32, Hsu, 1940) 


Reject H q whenever u = det (a (a+b)" 1 ) = n (1 + r.) < c. 

1=1 (7.13 


(ii) Lawley's v (LaWley, 1938) and Hotelling's (Hotellina 

« a a • 1 O ^ 


19 51) criterion 


Reject H q whenever v = tr ab” 1 = z r = T ° 

i=l i 


N^-p > c ‘ ^ 7 - 14 


(lii)The largest and the smallest root criteria of Roy (Roy, 195*5 

Reject H whenever max r. > c 

< 1 — (7.15 


Reject H Q whenever min r i > c . 


(7.16 


lv) Piliai's statistic (Pillai,1955). 

-1 p 


Reject H c whenever tr aCa+b)" 1 ^ z (l+ r J" 1 > 

isl i - c * 


(7.1-5 
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From Stein (1956) or Kiefer and Schwartz (1965) the 
Hotelling's T 2 -test is admissible. From Giri and Kiefer (1965) 
the Hotelling's T 2 -test is locally and asymptotically minimax as 
< 5 2 q 00 respectively and most stringent. From Giri, 

Kiefer and Stein (1963), Linnik# Pliss and Salaevskii (1966), 
Salaevskii (1968) the Hotelling's T 2 -test is minimax (see Sectior 
2.4.4 and original references or Giri 19 76/ 1977 for details) . Vi 
shall discuss the pptimum properties of all the five test 
criteria in Section 7.4. Now we shall find the distribution of 
R-/...,R under H.. 

X P ^ 

7.2. THE DISTRIBUTION OF ...,R p UNDER H Q 

In theory one would be able to derive the null distribution of 
five test statistic as given in (7.13)-(7.17) from the joint 
distribution of R^, ...,R under Hq . Since for any g e G^(p) 

det (gAg'-AgBg') = (det g) 2 det(A-XB) , 

by choosing g such that gig' = I, we conclude that to find the 

joint distribution of R., ...,R under H Q we can without any loss 

of generality assume that 1=1. Thus the joint distribution of 

R./.../R does not depend on 1 under H n and depends only on 

®l'**" e p' the characteristic roots of AA'l under H^: A^ 0. 

From Theorem 2.12, the random matrices A, B are independently 

distributed as W (l,r) and W (l,N-s) respectively provided 

N-s p, r _> p. Note that the assumptions N-s _> p, r _> P make 

both A,B positive definite with probability one. Let r = n^, 

N-s = r \2 and let R^ > R 2 > ... > Rp > 0 denote the ordered 

characteristic roots of AB -1 (the probability of two roots being 

equal is 0) . Let 
% 

R. 

V i = 1+R. ' 1 = 1 '**" P • (7.21 

1 

Obviously V 1# ...,V are the characteristic roots of A(A+B) -1 / 
that is, the roots of det(A-X(A+B)) = 0. 

Rather than finding the joint distribution of R...,R 
directly we will find it convenient to find first the joint 
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gtr ibuti°n of V 1# .. and then derive from it the joint 
g l- r jjDUtion of R^»..» # R . 

Let C = A+B and let V be a diagonal matrix with diagonal 
elements V x .V p . We can write 

c = vrw' / a = wvw' 

^ ere w = (W. .) is a p x p nonsingular matrix. Obviously the 
W * rix w is nit unique. To determine W uniquely we shall require 

”L t w > 0. i = 1.. (the probability of = 0 is aero). 

writing j for the Jacobian, the Jacobian of the transformation 
(A B) (W,V) is equal to 

-*■ (W.V) 3 = JC(A.B) * (h.C) 1 * J[(h,C) **■ (W.V) «7,« 

JUA# , l From Olkin (1952) 

i It is easy to verify that «£B> **.«=> > (7>2J) 

I jUA c) - (W.V) ] = 2^ (det W) P l v i j 

I (2 25) the joint probability density fun 

From (2.^ ^ n > p) 


H n is (with 2=1/ n i - P ' 

U i\ /'i 


C n r P Cn 2’ P 


(det a) 


(n r p-l>/ 2 (det b) 


(n 2 -P-l>/ 2 exp( .!jtr(a+b) } 

(7.24) 


where 

. (n-1) p/ 2 *P lp - l,/4 

) c = P n r'((n-i- 1 >' /2) 

n ' p i=° _viability de 


TT r (n-i- 1 '^' , . 

C n,p ~ i=0 bility density of ( W ' V 

„ . he joint probability )/2 

Prom ( 7 .23) - ( 2 • 24) p ^-p-D/ 2 ^ ' 2 


C 11 Vi 

f (W.V) " C na/P n 2 ,P 1=1 

^' V 1 (n ,n 2 - 


i =1 (7.25) 

( n i+ n 2 -p)/ 2 eXp {-Htr 


X , b Pility aensitv action 

i < J . ,7.25) the prohah^ 1 . 

Integrating ° ut “ lP (n 2 -P- 1)/2 „ (v t - v j’ 1 

of V is given by p (hj-P' (1-Vj.) l<) (?>2#) 

fylv) = KC ni ,P Cn 2 ' p i =1 
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where 


K = (2ir) 


= ( 2 7T) 


p 2 /2 


2 /9 (n-+n 2 -p) /2 

/(2tt)" p /z 2 p Cdet (coo}' ) ] exp{-J$trwu)'}dw 


-2 /0 (n 1 +n_ 

9 /2 E Cdet WW'] 1 2 


(n 1 +n.-p) /2 


(7.27) 


and W = (W i j), the W^ are independently distributed normal 

random variables with neans 0 and the same variance unity. Thus 

(n.+n„-p)/2 

to evaluate ECdet WW'] 1 z in (7.27). We may use the 

fact that WW' = s (say) is a Wishart matrix W^(I # p) and by 
(2.25) its probability density function is given by 


f„{s) = C (det s) ^ exp {-J$tr s} 
° P# P 


(7.28) 


Note that (7.28) holds only for the evaluation of K as given in 
(7.27). From (2.25) and (7. 27)-(7.28) 


E(det WW*) 


(n.+n,-p) /2 
• \ 1 * 


(n-+n 2 -p-1)/2 

= C /(det s) exp{-istr s} ds 

P*P 


= C /C 

p,p n 1 +n 2 ,p 


Thus 


K = 


( 2tt) 


P 2 /2 


’ n l + n 2# p 


(7.29) 


(7.30) 


, 2 \ -1 


Since dV^ = (1 + Rp” dR i , from (7.26) the probability density 
function of R, a diagonal matrix with diagonal elements R 1# ...#R p 


p (n--p-1) /2 -(n.,+nJ/2 

f_(r) = C n r, 1 (1+r.) 12 n (r,-r.) (7.31) 

R i=l 1 i < j 1 J 


Where 
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p/2 ^ 

r(Js(n l +n 2 - i+1) ) 


C s p 

n r( H(n^i+i) r(Js(n 2 -i+i)) r( P +i-i)) 

i=l 


7.3- 


distribution of test statistics under h. 


Since all the four test criteria are functions of R the distribu¬ 
tion of all test statistics given in (7.13) - (7.17) under Hq can 
be obtained from (7.31) . However, the exact evaluation of all 
these distributions in specific cases are quite involved. Using 
(7.31) and computing various moments of U = u p/rf H_ s under ' H 0 
we obtain 


Ku*) = n (x*) , k = o,i,... 
i = l 


(7.32) 


where X r ...,X p are independently distributed central beta 
random variables with parameters (H(N-s-i+l) , Hr), i = 1, ...,P. 
Since U lies between 0 and 1, these moments determine uniquely 
the distribution of U under H 0 . Thus under H Q U is distributed 
as U = ? X ± . Furthermore under H Q U p#r#N _ s and U r>p#N _p_s+r 

na ve the “same distribution. In the special case p = 1. (N-s)(l-U) / 
ru has central F-distribution with degrees of freedom (r.N-s) . 

For p = 2, (N-s-1) (l-Ai)/r Jv has central F-distribution wrth 

(2r 2 (N-s+1) ) degrees of freedom. Schatzoff (1966) has given a 

nethod of obtaining the exact distribution of U p.r.£s “ l " fl 

r i nn ti as a sum of independent 

the representation of -l°g e u p#r ,N-s tabulated 

variables and taking successive convolutions and has tabulated 
variaDJ.e& 2 percentiles to exact percentiles 

the factor for converting X pr P d for 

0£ - (N -s- iu 0 ^: 

:r g iv: - - —~rr f 

U while Filial and Gupta (1969) have given both for 

U p,r,N-s . remove the restriction pr < • 

P < 6 and have succeeded t (1938) haS used inverse 

Unlike Consul (1966) , who following « - ’ > ^ 

•tellin transform to obtain the distribution r.p.N-s ■ 


. s • pillai and Gupta (1969) have obtain 
p < 4 as infinite serx^, ^ except when both p and r are o** 

the same xn finite both Qdd tbe series became inf init 

in the case when p an ded schatzoff’s tables for 

Pillai and Gupta have a (19 72) has studied the proba- 

n < r < 22 (see Table 7.4). Lee l u* 

11 - r - „ when one of p and r is even 

bility density func ion J?! Express ions obtained by earlier 
with a view to simplifying the express 

i ^omnutations. When both p and r are odd), 
authors for numerical computation 

. c terms of simple integrals and has 
has given expressions in terms F 

* square correction factors to cover, i, 
extended the tables of chisquare corr 

, » . ■,r=iiv=«? r, < r < 20 and pr < 14' 

conjunction of previous tables, — . 

£ r- i <= odd and is greater than 10 

with omission when one of p and r is oaa an 

Ma thai (197 la ,b) and Mathai and Rathie (1971) have also derived 
the distribution! U „ s using inverse Melltn transform and 
computed » and 1% point of !» *_.. For further references 

connected with the distribution of U p<rN _ s under H C we refer 
Pillai (1976) . 

Box (1949) gave an asymptotic expansion for the distributi 

of a monotone function of the likelihood ratio statistic 

x= fu ) N/2 when H n is true.. The expansion converges 

p, r, N-s u . _ 

rapidly and therefore the level of signmcance den 

from it will be quite adequate for moderate values of N. For 

large N, the Box result is equivalent to the large sample resu 

of Wilks (1938), that is, under H Q , -2 log X is distributed as 

*2 with pr degrees of freedom as N -*■ 00 . The Box approximati 

(with p £ r) is, under H Q , 


Pf-r log « p , r ,N-s i z) - P{x prl z} 


+ (Y/r 2 ) [P {x 2 r+4 


< z} - P{Xp r £’Z>] + 0 (N 4 ) , 


where y = pr(p +r -5)/48. If just the first term is used the 

total error of approximation is 0 (N ), if both terms are use 

-4 

the error is 0 (N ) . Furthermore if r < p we use the result 

that under H 0 U N is distributed as U „ . Rao 

v p,r,iM—s r, p, N-s-p+r 

(1951,1952) has given another approximation in terms of beta s< 




r the Lawley-Hotelli n 
med approximate upper 5 

° r p = ... # 8 and for 

8 ) has shown that under 

yfhpn N -► 00 


lAA l 

fer (19 


on R^» 


there 

A'Z ' 1 


(196 

eU in 


the va 


that th 
/er tr a 
h 
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are admissible for testing H Q . A = 0 against the alternatives 
H : A ^ 0. Thus, as a consequence of the following result of 
Anderson et al. (1964), they are unbiased for this problem. 

Anderson et al. ( 196 4) have given sufficient conditions on 
invariant tests depending only on R^,...,Rp for the power 
functions to be monotonically increasing functions of each 0^, 
i = l, ...,p. Further they have shown that the likelihood ratio 
test, the Lawley-Hote1ling 1 s trace test and the Roy's maximum 
characteristic roots test satisfy these conditions. The 
monotonicity of the power function of Roy's test has been 
demonstrated by Roy and Mikhail (1961) using a geometrical 
argument. 

Kiefer and Schwartz ( 1965) have shown, using the Bayes 
approach, that Pillai's test is admissible Bayes for this 
problem and it is fully invariant, similar and as a consequence 
of the result of Andetson et al. (1964) given in the preceding 

i 

paragraph, unbiased. Using the same Bayes approach Kiefer 
and Schwartz have also proved the admissibility of the likelihood 
ratio test under the restriction that N-s _> p+r-l f although the 
a-dmissibility of the likelihood ratio test can be proved without 
this added restriction (Schwartz, 1964b). Sinha and Giri ( 1977) 
proved the Bayes character (and hence admissibility) of the h 
likelihood ratio test whenever N-s > p. Narain (1950) has shown 
that the likelihood ratio test is unbiased. We refer to Nandi 
(1963) for a related admissibility result and to John (1971) for 
an optimality result. 

\i> 

7.5. MULTIVARIATE REGRESSION MODEL 

We shall discuss here the multivariate regression model of the 
general linear hypothesis. This formulation is very appropriate 

0 i 

for the analysis of multivariate design model. Let X = 

(X .,..., X )', a = 1, ...,N be independently distributed normal 

Ot JL OtJP 

p-vectors with 
E(X a ) = 6z a 
Cov(X a > = £, 


(7.34) 


/ nati v« 

es H t: 

Powe r 

f ® ac h e 
lh °od ra 

S ln-s _ , 


constants, B = (B^l is . p x «» Vetoes 

rn = ha “ assu ® that N-s > p “ brix °* unknown 
z =(z,...,z) iss. Lg t g _ ' 6 ran k Of the s 
3 ubmatrices of dime nsions ~ B l' whe « B „ 

are interested in testing the , dnd P x (s ' r > resp 
o . ^ the hypothesis h 

where Bj^ is a fixed matrix, n and v °’ 6 1 ° 
the dimension of „ q ts . and ^ of ™ 

Eor :::::r • The iikeuh °° a ° f «» 


- B^Y0s 

this 

c onseque nee 
preceding 
Kiefer 
he likelihoi 
lthough the 
oved withou 
Giri (19 77 
of the 

0) has show 

r t o Nandi 

n (19 71) f° 


- rr 

1 & noc ^ 

,.„ted nor 


. a A Ov _ — ( E 

( X “. 6 z a ) (/-ftS ’ ‘ N g» 


I 125 
= 1 


2iu-Np/ 2 (det( S ^ ' 
( — •*) 1 









4‘v 








^ ■■ 

K ' ' 


analysis OF 


= ( z ail' 


a _ x a _ > , 
Z ) X = - 1 ( 1 ) 

•' ar 


$, = 1/ • • • t N • 


(7.41) 


Then under H q , 

“ = x°- 6 °. Z?,<- “ = 


J* = X - «i z (l)' “ . 

, normally distributed with E (Y a ) = 8 2 z (2) and 

are independently normally 

.. v v Partition 
the same covariance matrix h . 


3=(3 r 6 2 ). z=U 2 


, C = (C 1# C 2 ), A = 


A (11) ' A ( 12) 


( 21 )' "( 22 ) 


where Z is a r * N submatrix ofZ, C^isap x r submatrix of c, 

A is a r x r submatrix of A. The likelihood function under 

til) 

H can be written as 
o 

N 

L(B 2 ,E) =(2ir)~ Np/2 (det Z)~ N/2 exp{-^trZ" 1 ( 2M£ a -3 2 z^ 2) ) 


, a . a . . 

& - tJ 2 *(2> ) >>• 


(7.4: 


From (7.43)^ it is thus obvious that the maximum likelihood 

estimates $ 2 , and £ of $ 2 and Z respectively under H are given 
by ° 


-1, - CC, . (la)) 


Z = — £ 


■ «ix.'f“' S 2 2 (2)> <l a -e 2 ^ 2) ) • 


If 


" " ^ “‘W ' i 2 Z (2)-» <is“'S° ,» J 2 a , , 

_ 1 (1) m 2 (9\ ; 

Lemma 7 a 


* * - « + <Bi-#;)., a 


<U> (12)A (22) a |„,)(Lp 0 l , 


(7.4 






Proof _ 




6A = <B 1' S 2>( (l «- A ( l2) 


■ 

- V 




+3-,a 


(21 >' a ( 22 , ) ' 


2 A (22)) 


ye obtain 


^2) 


1) * A 
1) ' A 


< 12 ) \ 
( 22 )/' 


f 2 = S(1)A(12 > + % A , 22) . 

t::i A z positive *«■*. 


(^rom 


^matrix 

fun cti on 


- S 2 Z ( 2 )» 


•42) | 6 2 = C 2 A U 2) - B...A, 


c °nsideration), a -1 
' ( 22 ) 


.-1 


° f c, 

under 


Now, under H q , usin, (7 ., 7 , _ (7 . 4s) ‘ ‘ 

(x-3z) = U-h) ♦ (g‘ rgj)lj + (6 r e° )Zl 

= (x-Bz) + (B 2 -B 2 )z 2+ (S r6l O):. (i , ria2)A -l 2) l2 ,. (7 . M 


(7.43) 


, . , 1“ A (12) A (22) Z 2 )Z 2 A (12) A (12) 0 ' 

Likelihood V\ 

r H Q are given J[ x _$ z )z' = xz'-xz' (zz') -1 zz' = xz* - xz' = o . 


> A (22) 


f Si nee (7.49) implies that 

! (x-Bz) z'. =0* 1 = 1,2> 

we obtain from (7.«) - (7.50) ^ 

+ CB 1 -e? ) (A (11 )- A (12) A C22) a ( 21 > 1 1 1 

„ 3 _ I = lx-Bz) +(8r 6 l ,(Zl 1121 22 


Brttf i 
M". 
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1 o 

=(x-3z) (x-3z) ' + (A (11) - A (12) A (22) ^ {21) ) (e r e 1 ° ) ' 


Q.E. 


13 


Using (7.40), (7.47) and Lemma 7.4 we conclude that the likelihoo 


ratio test of H Q : ^ f 3^° ’rejects whenever 


4 

i 1 


det ( Z (x a -3z a ) (x a -3z a ) •) 


det ( E (x -30 U -30 •+( Bj^-3^) (A ( 11 ) -A ( 12 ) A ( 22 ) A (21) J 
a=l 


(Bj-B*! 0 )') 


where the constant c depends on the level of significance a of th 
test. Wg shall now show that the corresponding statistic U in 
(7.51) is distributed as the statistic U in (7.9) . Wilks ( 19 32) 
first derived the likelihood test criterion for the special case 
of testing the equality of mean vectors of several populations. 
Wilks (1934) and Bartlett ( 19 34) extended its use to multivariate 


regression model. In what follows for notational simplicity we 
shall use the same notation for the estimate and the 



estimator 


For the maximum likelihood estimator 3 of 3 


E (3) = E ( Z X a Z a 'A _1 )=BAA" 1 = B . 


The covariance between the ith row vector 3 . and the ith row 

A A J J 


vector Bj of 3 is 


E <!i - li> ( - lj)' 


= A_1e ‘* 1 (X ai- E(x ai>>£“ ( ! < X Y J- E <\ 1 ))£ Y, )A - 1 


pp 



leorem 7.5 . N £ 

A 

idependently of 3 


“ / 

N 

= 2 x a x a - 

a=l- 

as W (E,N-s). 

hr 


A A 

BA B' 


is distributed 


Proo f. Let F be an s x s nonsingular matrix such that 
I and let E^ = Fz. Then E 2 E 2' = Fzz'F' = FAF' = i. This 
lplies that the s rows of E^ are orthogonal vectors cf unit 
ngth . Thus it is possible to find an (N-s) x n matrix s 

E i IN 

at E=( _ ) is an orthogonal matrix. Let Y = (Y ,... # Y ) = 

L 2 

us the column vectors of Y are independently distributed nc 
ctors with the same covariance matrix l and 


Y) 

= BzE' = BF 1 E 2 (E i ' E 2 j = (0,e * 1] * 


nee 

1 _ 

N i ^ „a v 0 ( 1 

£ X a X° = YY' = J I i 

0=1 " “ 0=1 

(7.55) 

/\ 

6' 

= XZ'A-W^X' = YEEIF' 1 )' A-V^jE'V 



1 

^ a a 1 

= Y< 0 > (o,dv = i ,n 

I a =N-s+l 

(7.56) 
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A N-s , 

n e = e . 

a=l 


where Y a , a = l,...,N-s are independently distributed normal 
p-vectors with means 0_ and the same covariance matrix E . 
Thus from (7.56)-(7.57) N I is distributed as W (N-s, I) 

A * 

independently of 3 . 



Q.E. 


TheorenLJLjS. Under H 0 (S r B 1 °) (A (u) -A (12) A’J 2) A (21) ) 


(3^-5-j 0 ) • is distributed as (r, E) (independently of N E ) . 


Proof . From (7.54) the covariance between the ith and 
rows of the estimator is 6 ±j (A (11 ,-A( 12 ) ^ 22 ) A (21) ) " 1 . 


jth 


Let B be an r * r nonsingular matrix such that 


B (A (11)" A (12) A (22) A (21) )B ' “ 1 


A -j 

and let ej-3® = YB = (Y , ... ( ,Y r ) B. Then 

( ®l _e i } (A (11)" A (12) A (22) A (21) ) ( ^l~ e i°) ' = Z = YY ‘ . 

j-M— 1 


Under H q , since E( 3^) = 3^ # 


E(Y) = E((3 1 -3 1 °)B’ 1 ) = 0 


I^t the ith and jth row^of Y be Yj respectively and let 
ith and the jth row of ^ be 3. y respectively. Then 


the 


(11)~ A (12) A (22) A (21) )-1b 


■^1 I / * * „ - 1 . . — 1 _ J 


=a ij ( B")' ( A 


u.. ^ (B (A,,, x - A/ A,L> A,„.i ) B*) “ ■*■ 


(il) (12) ■( 22) A (21) 


=°ij *• 


IhUS Jiis distributed as W p( l,r) <r>p) when H Q is true. 


independence of % follows f rom Theorem 7.6. 


Q.E 





MUL I I VAR | /\TE GENERAL unear hypqthes|s 


*■ APPLICATIONS of multivariate general LINEAR HYPOTHESIS 

-* e a fiXed set of v treatments, experimented conditions or ‘ 
jjasonistic classifications, etc. are of interest to the 
gjtperinenter and in the presence of each sore p response variates 

b ® recorded on the sampling units. These ireasurenents will 
ass,OTed to be inde pendent observations on v p-dimensional 
normal populations with different neans u°, a = 1,.... v but 
with the sane positive definite covariance matrix I . Let 

—ij ^ X ijl' * *" x ijp^ ' ^ “ L,...,n^, i = 1,..., v (N^ > p) be a 

sample of size N^ from the ith normal population with nean 
J t and covariance matrix I (positive definite) . we are 
interested here in testing the hypothesis 


V .H _y — _y (say) . 


In the notation of Section 7.1 it is a multivariate linear 

hypothesis with s-r = 1, N = Z N ± , s = v . Define 

i = l 


N i*i = 1 Nx = ^ Z x s = I (x - x, ) (x 

j 1J *• i j 1J 1 j 1J _1 * “ 


- x. ) ' 
ij -l. 


(7.58) 


s = Z s 


The likelihood pf the observation vectors jr j = 1, 
i = 1/ ..., v is 


, Z) = (2tt) " n p/ 2 (detZ ) N//2 exp{-J$trZ“ 1 (Z Elx^-^) 


i J 


')>• 

(7.59) 


The parametric space is given by 

0 = { (y l , ... , y v , £) ) 



nd it reduces to u) = {(ji,£) } when H Q is true. Using Lemma 7.1 
ax Ltjj 1 , .. . ,jj V , I) = max (2 tt) ~ Np/2 (detZ) " N/2 exp{-HtrE X b) 


= ( 2l)- N P/ 2 ( det s)" N/2 exp{-Np/2}. 


(7.60) 



* 
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Under H the likelihood function reduces to L(jj, £) and 
o 

ns L (V. I) -max (2 tt) "^P /2 (detZ): _ ^ /2 exp{-^trE - ^( £ 
u “ Z A J 

(X. .-X )) } 

—xj 

= (2jT)-N p /2 (d et(z E(x -x )(x..-x ) •) “ N/2 e xp {-Np/2 }. 

N . —1J • • x J • • 

1 J (7 * 


Since 

£ £ (x. . 
i j “iJ 


X ) (X; . - X ) 

• • lj • • 


= £ £(x. ,-x. ) (x. .-x ) 1 + £ N. (x. -x ) (x. -x ) ' 

i j -ij -x. -ij . ± 1 -l. -i. — 


we obtain from ( 7.60)-( 7.6 1) 


maxL ( m, £) 


max L (jj* , ... ,jj , Z) 

n 


r det s 

[ --- -]• 

det (s+£ N. (x. -x )(x. -x ) 4 ) 

•i "*■ • **• • 1 • • 


= r det (s) n /2 

L det (s+a) J 


where a - Z (x ± -x_) (x ± # ) 1 • Thus the likelihood ratio 


test- of H q rejects H q whenever 


u = < k 

det (s+a) 


(7.62) 



where k is a constant depending on the level of significance a 
of the test. Though from Section 7.1 it follows that under H 
the statistic U (with values u) is distributed as u ° 

we shall present below a direct proof of this as it‘wuJ'teip us 

i°^ e ' OP a bettar unt3erstan <3ing of various distributions 
involved in two and higher-way multivariate classifications. 

From Theorem 2.12, A (h^inr, = * 

z, a ibeing a function of x i - i v) 

«d 8 (with values s, are independent. Let ^ . 
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.*»i.*VI.x^) 

and let the matrices A a and A M u 

be given by 3 ' c dimension N x n 


A 3 


1 1 
N' • * *' n 


_1 

N ' * * * •' 


h 2 ~ B ' a 3 < A = I-b 


where B is a diagonal matrix in the block form with diagonal 
elements B ( jy < • •.,B j vv) , B (il> being a ^ x matrix each of 
whose elements equal to 1/N i and I is the N x N identity matrix. 
Obviously A i A^ = A i# i = 1,2,3. In other words A 1 ,A 2 and A 3 are 
idempotent matrices. Thus 

rank (A 3 ) = tr A^ = 1, rank (A 2 ) = tr B - tr A^ = v-1, 
rank(A^) = tr I - tr B = N-v . 

Furthermore A^A^ =0 (i ^ j) . Now 

Z Z (X.J - X ± _> (Xij-Xi.) ■ - »!*■• I «i ‘Xi.-X.J (Xi.-X..)' 


= XA 2 X'. 

Since A is idempotent of rank N-v there exists an orthogonal 

. f , a F = ( I(N-v),0 ) where ! (v-D is the 

matrix F 1 such that Q 0 


(v-1) * ( v 


- 1 ) identity matrix. Hence 

N-v 


XA,X' = (XF.MFiA^) (XF,) ' H 

,1 v N l = v and x“ = < Y al. Y <*p’ '' 

where (XF^) = (X #•••<— 

a = 1# . • • /N • 

■ v -X #•••'—vN ^ 

x -X. , •••'-£vl -v. vw v 

= (x u -ii.. x w 1 -i- 

we obtain E(XA^) 

. ) = E(Y)F^A 1 = 0 
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which implies that E (Y) F^l^l = * • • • E (Y N V ) > 0.» -.., 0) =o 

Since is an orthogonal matrix Y^, ...,Y N V are independently- 

distributed normal p-vectors with mean JD and the same covariance 

matrix Z . Hence XAX' is distributed as W (Z,N-v) . Similarly 

P 

since is an idempotent matrix of rank v-1/ 

^2~ 1 ' * * * ' — 1 t • • • t 21^ -X ' * * * '21 -v; “21 ^ 

and E (XA 2 ) = 0 under H q , we conclude that XA 2 X' is distributed 
as W p (Z,v-l) when H Q is true. Thus under H o U = det S/det(S+A) 
is distributed U p N _^and we have discussed its distribution 
in Section 7.3. 

In the setting of multivariate regression model the 
mathematical model of the multivariate one-way classification 
can be written as 

E(Xij) = ez lj , j = i = , 

C° v (X ± J) = Z (positive definite) 
or equivalently: 

E(x) = $z, Cov(X^j) = E (positive definite), (7.63) 

where 6 is a p x v matrix of unknown elements, z is a v x N 
matrix of known elements and are given by 


V = (jJ V , ... ,jj V “ 3 '-jj V , u v ) , 


2 ..v 


* ■ fen 


.-1N/---'^V1.i V N > ' 

J- \) 


1 .. 10 .. 0 .. 0 . . 0 0 ..0 
0 .. 01 .. 1 .. 0 .. 00..0 
o . . 0 0.. o 


0 .. 

1 .. 


with z A . 


0 0 . 
1 1 . 


0 .. 1 

1 .. 1 


(z ij l'***' z ij i = 


• ,* ± , i 
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The hypothesis H 0 reduces to H a 
- --t.i''- 1 - M v ). o : ®i ~ ° where |3 = ^ ^ 

Note that this representation is not the , 
this case. One other natural rpnr . y P° ss ihl e ln 

P V - J. ) with vu = i uC a ! . atl ° n is 8 ■ (M 1 - y . 
reduces to's = 0. 1 hYP ° theSiS H o 

The likelihood Of the observation x on x is given by 

L(3,£) = (2TT)" N P/ 2 (det v \ -N/2 , 1 _! 

(aet exp {-Jjtr £ Vbz) (x-6z)■} ( 7 . 6 « 

Elince the rank of the v x N matrix z(v < N) ±s v< 
and the unique maximum likelihood estimate of 6 is 

A. 


3 = xz'(zz') 1 = (x -X X _v 

~1. —v.'— 2 . —v .' * * *v-1. —v.v * (7.65 

From (7.64) using Lemma (7.1) the maximum likelihood estimate of 
l is 


I is 


E = i(xx--S(zz)B-) = i I E(x -x )(x -x )■ 

j_ j 1 J _L# -LJ -*“• 

Under H q the maximum likelihood estimate of Z is given by 


(7.66 


Under H q the maximum likelihood estimate of Z is given by 

A A 

Z = Z Z (x. .-X ) (x .-x ) 1 = Z + Z N. (x, -x ) (x. -x ) ' (7.6 

j 1J • • 1 J "" • • ^ ^ • •• 1 • “• • 

From (7.66) - (7.67) the likelihood ratio test of H Q rejects H q 
whenever 


U = det (5.1 - < k 

det(a+s) — 


(7.68) 


*here k is a constant depending on the level of significance of 
the test and the corresponding statistic 0 is distributed as 


j when H is true, 

p,v-1,N-v ° 


c^Hon 7 1.1 other invariant tests of H £ 
As discussed in Section /.i.i 


ire as follows. 


. Reiect H whenever 

L) Lawley-Hotellihg's test criterion. Rej 


tr (a k • 
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(ii) Filial-s test criterion:. Reject H q whenever tr(a(a+s)- > k . 

(iii) Roy's test criteria based on the maximum and the minimum 

characteristic roots of a(a+s) 

If we reject H q by using any one of the four test criteria 
discussed above we conclude that not all the mean vectors are 
equal. Naturally such rejection leads to further investigation 

to decide which mean vectors are different. 


Since, 


mean 


/N.N. 

(x, _x. ) has a p-variate normal distribution with 

N i +N j 1 * J * 


N.N . 
i 1 


N.+N . 
i J 


(y i _yJ) and positive definite covariance matrix E and 


the random 


N.N 

l 


matrix S is distributed independently ofj (X^ -X^ ) 
KF XT 1 J 


N.N. . 

W (E,N-v), -x. )' S (X. -X. ) is distributed as 

P IN . "rJN . 1 • J • 1 • J • 

2 J 

Hotelling's T statistic with N-v degrees of freedom and 


N.N . . . 

noncentrality parameter J ) 1 E" 1 ) . Thus from 

i j 

chapter 2 the optimum test of level a of the null hypothesis 
jj 1= rejects it whenever 

(N - v)( N^ )( *i.-V' ^t I: - v^ +1 ) f i^,p,n-v- p+ i- 

Note that from Theorem 2.14 under the null hypothesis ^ N ~ v ~Pf—T 2 
. p(N-v) 

has central F-distribution with (p,N-v-p+l) degrees of freedom. 

thus on the basis of such comparisons for all possible pairs 

'H ) (i^j) it will be possible to divide the v populations 

into different groups such that the populations of the same group 

have the same mean and those of different groups have different 
means. 

7 * 6 * 1 * Experimen tal Design Model of the Multivariate One-way. 
C lassifications : We have discussed in complete details the 
univariate one-way classification in Section 3.6. We shall 




UNE « hvpothes, s , 

>n sider here a formulation of v 

classification in terms of multiy! mUltivari ate one-way 
the design matrix z does not regression model „ here 

ultivanate regression mode! (1m £ ^ ***** tte 

-lj' 2 (positive definite) 

where 6 = (p, a 1 .„Vy and ( 7 ., 


1 1 . 

1 0 . 
o 1 . 

0 0 . 


The matrix £ is of dimension p x (v+1) and. the matrix z is of 
dimension (v+1) x N. Obviously the rank of z is v (assuming 
v+1 < N) . 

Clearly (7.69) can be obtained from (7.63) writing 

jj = v ^ £ jj^ , a 1 = - _y, i = 1 ,..., v and in this obviously 

i 

£ a 1 = 0. 
i 

As in the univariate case all linear combinations of the 
a 1 are not estimable. To see what kind of linear conbinations 
of the a 1 are estimable let us consider the estimator £ £a..X i: 
where the a^ are reals with £ = c i (say) . Since 

E(ZIa X > = ^- 1+i( i Ci> 

i j J 1 J , 

, y c a is estimable 

n n anv linear combination l c^ _ 

we conclude that any i 

r r is a contrast. In 

if l C i = 0* In ° ther W ° r i 1 - i J Wj) are estimable. 

particular all element /. 

But neither I / "or any one ^ ^ restricti0 n 

_ 1 _v i ~ 1 However, u we ^ 
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E o 1 = 0 then the unique estimate of ji.o . i <- 1.v are 

i 


x , a 


x. -x , i 

•"X • • • 


1= x , a 1 = x ± - x , i = l/..»/ v « (7,-j 

Thus for the model (7.69) with the added restriction that E £ 
the unique estimates of ji and the a 1 are given by (7.70) , l I n 
this case the hypothesis H Q :a 1 =...= a V reduces to H^all a 1 = o 
and the likelihood ratio test (Wilks' criterion) of H q is given 
by (7.6 8). 

If H is rejected it is necessary to. test estimable 
hypothesis of the form : E c^ jot 1 = _6 where E c^ — 0. and S. is 
a fixed vector and the appropriate level a invariant test of 
is given by. 

Reject H* whenever, 

(N-v) X (Z c^ -_6) 's 1 (2 > ^N-v-p+1) F l-V/P,N-v-p+l, 

where 

X -1 = E (c. 2 /N.)'. Note that under H* 
i i o 


(N-v)X(E c. X -&) ' S" 1 (E c. X. -6) 
i 1 I* i 1 1 * 


2 

is distributed as Hotelling's T -statistic with N-v degrees of 
freedom. 


7.6.2. Multivariate Two-way Classifications with Sinale 


Observatio ns P er Cell ; The experimental set-up is the same as the 
univariate two-way classification as discussed in Chapter 3. 
Suppose that we have a set of independently distributed normal 
p-vectors X = (X, X )', i = 1,...,I, j = l,...,Jwith 

—ij iji ljp 

8(2^) = y + a 1 + Cov(X i:j ) = E (7.71) 

where E is a positive definite matrix, ^ = (y 1# ...,y ) ', a 1 = 

(a , ...,a ) • satisfying E a 1 = 0 and B j = (6.3. )' 

satisfying E =0. With this model the hypotheses of interest 





m 


“ - ~ ' a, . aii - ‘-'neaf 

r t-he ^ ^ 911 8 J _ „ ’ “'"ESlj 

in the terminology ~ ~ 2. . 

hypothesis with r = Se tti otl ?>i 

Similarly Hie ~ 1 ~ 1 « s = "a is a mui,.. 

T * 6 ls a mult,,, 1+J ~1 ana » ltlv atiate „ 
s = I+J-l an d N _ a _ tlv «iate Un d N ' s » (I-n ,! Une; 

observations x ~ <J-1) 6arh ypoth esi ' 1)l 

! 1S9iv ->y oTZ c " 3 - 

MjJ.a 1 - - 1 -1 he 


M£ '- ./,D 

= (2m) p/2 ( det j.j-U/2 

e *P {-Jjtrr 1 ^ i( x . i j 
* ^ij-h-a 1 -^) ■) } i j "i > 

Under tt the* 

a e maximum likeli hood .. 

H = x , a 1 = v "j '1 ar « 

• • ~i. ~3S * 8 = x -y r, 

1 t'\ ~ --j • By using Lemma • 


i = X. . a 1 = x . _ x ;j — °* , 

TOX T/i 1 T * * ■. ~ ■’•j * By Lemma 

“ L( ^.£ -e 1 ./.I) 

= max ( 2 *r IJ P/ 2 ( d et Z)-^/ 2 exp{ . !5trr l £ ^ _ x 

i j 1 * 

* ( -ii _ -i * i +x )•)}. 

= (■—) _IJP//2 Cdet ( Z Z (x -x -x +x ) (x, .-x. -x .+: 

u —i j —i. —. j - 


J -L . —. J . 

= (■—) _IJP//2 Cdet ( Z Z (x. .-x. -x +x ) (x, .-x. -x 

Id i j “I- J -1J “1. 

xp{-IJp/2). 

nder H the maximum likelihood estimate of £ a 

ct 

= X 7-x and 

• J • • 

1 j , 

max L (jj / 8, / • • • /£ ' 

a 

-IJ/2 

( 2l ) -Up/2 Ldet( Z ^ (x i j"^.j ) ( -ij'-.j * } ] 

IJ i j 


-IJ/2 


exp(-iJp/2) 


l l 
i j 






EFWm*' 

HK'- ■- 
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where 


+x ) 


c = Z l . } -*• 

^ i j 

a = £ J (x. -x ) (x H -* ) ‘ • 

, , «r u rpisets H whenever 

the likelihood ratio test of a J _a 


v«M 


U 1 = 


det c < 
det(a+c) — 


where the constant k depends on the level of significance a of tfa 

test. Under H a the statistic (with values u.^) is distributed 

as u “ As in the case of one-way classification 

p, 1-1, (I-1) (J- 1) 

it can also be verified directly that the random matrix 
c = 2 z (X..-X. -X . + X^) (X-.-Xi.-X.j + X..) ’ is distributed 

independently of A = 1 J (X- -X ) (X-s - X ) 1 as W (2, (1-1) (J-l)) 

r ^ Jj • • • -L • • • Jr 

and under H A is distributed W (2,1-1). Similarly it may be 
a P 

verified that the likelihood ratio test of rejects whenever 

det c . -i 
^2 det(b+c) - k 

where the constant k depends on the level of significance a of 


the test and b = E I (x .-x ) (x .-x ) 1 . The statistic U„ (with 

J • J “ • • .J””.. z 

values u ) is distributed as U . , T ... , ... The Vi 

2 p, J-l, (I- 1) (J-l) 

matrix C is independent of B = E’ I (X .-X ) (X ,-X ) ' 

j *J •• • J • * 


The Wish art 


In order that C,B,A are.positive definite with probability 
one it is necessary to have p<_I-l and p<J-l. Other invariant 
tests of H a and can be obtained following (7.14) _ (7.17) . 

In terms of the multivariate regression model the mathematical 
model of the multivariate two-way classification can be written 


r 


E(X) 3z, Cov(x i .) - £ (positive definite) 


(7.75) 


where X = (x . y v . 

a 1 is a p x ij matrix, 

p = (u.a . c„J* • 

.- ' £ . & 1 ls 3 p x (I+J+l) matrix and 

' ....ill.5„) 




U1) (J-l)) 

Lt m ay be j 
H g whenevei 

ince a of 
.c U 2 (with 
[ishart ^ 

obabili^ I 

vari ant I 

I 


B i ® ^ (I “hJ* -f" 1) x T T . 

I ' Ij Of rank J. + J x 

-«— y classif 

I a j-ejected it is necessarv . J , 

I the form H* • r a ryi ^ ^ t( ^ st hypotheses of 

a ' 1 1 5 - Where the a i *» real' constants satisfy 
“* I a. = 0 and 6. is a fixed vector. Let r‘ . z a 2 ^ 

yi% 1 

. a i—i . has a P"variate normal distribution with nean l a 

i 1 

^ and positive definite covariance matrix Z and the random matrix 

wWstributed independently of \' 2 Z a.x. as w (Z, (i_i) (j_i)) we 
I . j_ 1 1 • P 

[obtain from Theoretn 2.14 that 

' (1-1) (J-l) A (Z a. X. -6) • C _1 ( Z a X -6 ) 

i • i 

is distributed as Hotelling's T 2 -statistic with degree of freedo 
(I_x) (J-l) and the non-Wntrality parameter X( E a. a -6)'I 

( J. , i s) . . Thus the optimum invariant test of rejects 

i i ~~ 

H* (at level a ) if 
a 

m . c ' l <l a.x. -i) 

(i-d(j-i) a ( ? a i *i.~- x 1-1 * 
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Similarly if Hg is rejected the optimum invariant test of h 
£ a gj _ y , wFere the a. are real constants satisfying 

Z aj = 0 and ] is a fixed vector, rejects H^| if 

(1-1) (J— 1) X (2 a X ,-j) ' c _1 (Z a x^-j) 

• J -J -J 


. q-i) (J-Dp P 
- (I-l) (J-l)-p+l F l-a,p,(I-l) (J-D-p+l 


.n-1 


where y ^ = Z a, /I. Note that (I—1) (J—1) y (Z a . X i~ J() 1 C 
j J J J * J 

2 

IT. a.X . —y) is distributed as Hotelling's T -statistic with 

j J “* J 

noncentrality parameter 

y = (Z a. i j -Y) ' Z _i (Za _§ J -Y) . 
j J j J 

7.6.3. Multivariate Two-way Classifications with K Observations 

" • 

Per Cell ; Suppose that we have a set of independently 

distributed normal p-vectors X. ... = (X. X. ,, J • 

—ljk ljkl ijkp 

i 1/ • • • / 1/ j k — 1, . . ., K, (K ^ 1) with 

E(X ijk ) = _ti + i* 1 + + J 1 ' 1 / Cov(X.^ ) = 2 (positive definite) 

where 

.H — (y i # ■ • • / M ) > Q. — ( ct. ct. )' w i th Z a 1 = 0 , 

i n ii ip —' 

l 

— J = i' • • • • £,- p ) ’ with 2 = (y , .. y, . ) 1 with 

i . J . JP j ijl ljp 

^ — “ j = 0. The Y ^ are interaction between the two 

factors of the two-way classification. In the notations of 
Section 7.1 H a is a multivariate linear hypothesis withr-I-l,s 
I J, N-s=ij(k- 1) and similarly for Hg. The hypothesis H is a 
multivariate linear hypothesis with“r = (I-l) (j-1) , s = IJ, 

^ IJ(K-l) . Under the maximum likelihood estimate of 
—' 2l • ^ > I Z are given by 



B J = 


x 


V 




JK: -ij.‘ 


s = (IJK) t Itu 

1 j k -ijk ' 4 ) (x 

Under the hypothesis H 

\L» J. ' JC ^ and r e ^ximum u > . 

, *. Xare ^by llke Uhood es tlrate of 

' = i ...^=i r * Hi 

* J - 1= (v 

i = (IJK)-l I ; j 

i j k -ij k '%j. + %..- iS ) (x . 

[«t ■ • • -ijk %j ,+St -x ) 

' * • • • 




k 


** 'W 

I 


t i -i -•'»=> as Ho., • 

-' 1 ana * - C ^ 


(J = X 


oa «»tinat e of 


G = Z Z Z (x _v w 

1 j k -ijk -ij.) —ijk~^ £ j,> 


D = Z Z K (x _Y v 

i j -i-j. -i..- X .j. +k ...)(2 iJ .-x i __-x + x 

* J " • • • 


B = 2 IK(X -X ) (X . -X )■ 
A = Z JK (X -X ) (x. -x ) • 

j_ A.. • • • 1 .. —. . . 


and let g,d,b,a denote their sample values. The random matrices 
G f D,B,A are independently distributed Wishart matrices as 
P £ I- l/ p <_ J-l by assumption. It may be easily checked that the 


likelihood ratio test of H rejects H if 

a J a 


_ det q v 
J 1 det(a+g) — 


tfhere k is a constant depending on the level of significance 


of the test under H q the statistic U^with values Uj) is 


distributed as u p# i_ l, IJ(K- 1) * 

Similarly the likelihood ratio rest of H £ rejects H, 


whe ne ve r 


det g 
det (b+g) 


< k 


16 , on the level of significance u of 

the constant k depen s ( wi th values u 2 ) 1S 

the statistic u 2 


and under H, 


Wk 
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distributed u p/ j_i,iJ(K-l)* 
H rejects whenever 


The likelihood ratio test of 


- -;- < k 

u 3 ” det (d+g) - 


_ det g 

where the constant k depends on the level of significance « ot 
the test and under H the statistic Uj (with values u 3 > is 

distributed as U (J _ 1} , IJ (K-l) ■ ° ther tests 

H and H can be obtained following (7.14) - (7.17) . 


of H 


a J3 Y _ 

If H is rejected one would be interested m testing 


hypothesif of the form H|= ? a. j 1 = J where the a. are real 
satisfying E a ± = 0 and 6 is a fixed vector. The optimum 


invariant test for this problem is to reject H* if 


X 1 IJ (K-l) (E a. x -§) ’ g _1 (E a. x. ~ §) 

. 1 la* * J- • • 

1 1 


IJ (K-l) p 


- IJ(K-l)-p+1 r 1-a,P/IJ(K-l)-p+1’ 


Similarly if Hg is rejected one would be interested to test 

hypothesis of the form H* : E a 3 J = y where the a are reals 

£ . i J 

satisfying E a. = 0 and y is a fixed vector. The optimum 

j J 


invariant test for this problem is to reject H* if 

p 


U 1 IJ(K-1) (E a x - X )' 9 1 (E a. x -y) 
j J .J. j J -J- 


IJ (K-l) p 
- IJ(K-l)-p+1 


1-a,p,IJ(K-l)-p+1 


If is rejected then one would not ordinarily explore it 




sow 


■om I 


A 


further, although theoretically one could use similar invariant 


test procedures as in the case of H* and H* 

a p 


If it is possible an effort should be made to collect equal 
number of observations per cell. The analysis of a two-way 
classification with unequal number of observations per cell is 
more complicated. Nevertheless, exact procedures used in the 
univariate case can be extended to the multivariate case in a 
straightforward manner. 


th 


Roy and Bose (1953) and Roy (1957) have studied the problem 
construction of simultaneous confidence interval associated 





.,ith the maximum chara ct GEN ERAL Li near HYp 

L,ans of mu ltlvar _.*«*«:*, root 375 

T " e snalvsi o _ • lt ©rion of « 

m n Examgie^ 1 of variance DrnKl R ° Y for the 

,Ck V^ 7~' --■* ^Wo~wair P-^oblern 

dlf ferent~l[^7^^ 

^«^9 71 . 1972 in “*« c ^nr^ u ; rabie 7 -> «*«„. 

components: (i) ° ar ' India. Eaoh Y whlc h were raised 

>e ad in centimeters in ce "timeter* e "“ lon vect °r has 6 

(IV) average yield ‘ 17 > average number oi 'a ° f 6ar 

thousand such in gram Plant “ drams, , v) „ S per P^nt, 

column indices run S ' <V1> av ®rage ear ■ ei3ht ° f ° ne 

indices run over fo^. drosses If^r- ^ 

OUr different i bar -ey and 'the r-r, 

NOW u = aet h/a locations. he row 

fc b/det < a +b> . , 1449> 

~{ (1-1)(J_l) I . 

From Table 7 i 2 ^P-I+2) } i oa , _ v -v_ 

Xe 7 * 3 we obtain g u = ^7.60985. 

One percent value of {_ {1 , . , T 

live percent value of {-< 1 - 1 ) ^ U ia “■<«> <«-«5 

thus our data show there 1 ° 9 U is <l-02> (85.96N 

crosses . “ aidnif ioant aifferenoes 

BlocV n„„<„ 

jrfr univar late randomised bk,ck de ~ la2 '’ ^ h3Ve treated 

feapter 5. For the raultivariato “ mplata details in 

i the unrvariate one but each observation herH^ “ f® Same 35 

^observation o^' ^pln^valilt^' ^ j = 7 .- 

Jlochs each containing v treatments. ^‘tL^ “ ^ 
randomized block design the mathematical model is 
r-ij) = v + d 1 + il Cov(X. .) = S (positive definite) 

•here X.J is the observed effect of the jth treatment in the 
*th block, jj = (y , y ) 1 is the grand mean, a 1 = 

a il' ’** ,a ip^ ’ is the effect of tlie ith block such that j a 1 

5 0./ J3 J ' = (3 . 1# . . ., 3^ ) ' is the effect of jth treatment 
Satisfying £ jf 


J-L -i JP 

'^ = 0_ and the . are 


independently normally 
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Table 


Double cross 


of barley 


Locations 


126.60 123.26 

59 . 8 O 46.20 
19.66 21.55 

5.62 4.46 

101.32 91.62 

18.03 20.54 

133.04 133.96 

39.60 52.20 


19.95 

3.34 

77.08 

23.08 


21.24 

4.40 

82.62 

18.78 


l3 7.24 129.26 
46.20 56.20 
19.33 19.J8 
4.44 5.46 

108.72 103.04 
26.74 18* 87 

130.50 126.26 
52.00 39.80 

20.65 21.54 

4.64 3.58 

85.46 76.36 

18.42 22.33 


193 26 140.06 

ll 60 «*•» 

20.33 20i79 
4.62 4.26 

89.18 110 . 1 ^ 
19.30 20.85 

127.58 119.36 
51.20 42.20 

20.43 19.94 

4.52 4.06 

78.74 69.64 

17.42 17.15 


i*" 76 “Ho 

55.80 51.W 

20.65 20 * 38 

5.04 5.08 

91.78 105.32 
18.21 20.72 

128.54 118.42 
45.60 44.40 


20.31 

3.92 

97.40 

24.85 


113.90 

37.60 

125.42 

39.60 

127.96 

42.60 

115 

28 

.82 

.20 

133 

40 

.74 

.60 

121 

52 

.26 

.80 

107 

39 

20.99 

19.40 

20.06 

21 

.64 

19 

.01 

19 

.37 

19 

3.22 

3.56 

3.86 

2 

.46 

3, 

.78 

4, 

.48 

3 

91.96 

72.18 

80.32 

68 , 

.16 

,74. 

.02 

92, 

. 66 

85 

28.56 

20.27 

20.82 

27, 

.70 

19, 

.58 

20 , 

.68 

28 

121.52 

128.06 

129.24 

125. 

,10 

133. 

,82 

130. 

,78 

132 

52.40 

31.60 

48.00 

61. 

,60 

38. 

,80 

49. 

,20 

52 

20.03 

20.65 

19.69 

20 . 

00 

19. 

23 

20 . 

,27 

19, 

5.12 

2.78 

4.36 

5. 

34 

3. 

12 

4. 

28 

4, 

92.46 

77.68 

90.48 

99. 

66 

82. 

96 

96. 

12 

77, 

18.06 

27.94 

20.75 

18. 

66 

26. 

42 

22 . 

46. 

16, 


20.54 

4.28 

78.74 

18.40 


39.20 

20.73 

3.46 

72.28 

20.87 


41.60 

19.52 

3.72 

93.66 

25.18 


129*64 121-40 

48.20 52.00 

19.09 20.28 

4.22 4.96 

109.28 90.22 

25.90 18.19 

128.04 122.48 

49.20 40.80 

19.78 20.52 

4.56 3.84 

86.28 95.32 

18.92 24.82 

116.06 118.32 
48.00 36.20 
19.54 20.32 

4.36 3.28 

96.76 96.16 

22.19 29.32 


The matrix a, b are given by 


/772958.19120 
/ 281956.78700 
122836.93360 
25553.52290 
542196.30440 
\131499.07676 

^774437.42900 

282055.10200 

122862.63635 

25560.87220 

542398.55150 

^ 131452.59180 


44.80 

20.44 
3.94 

92.44 
23.46 


55.80 

20.50 

4.94 

88.42 

17.90 


121.26 134.^ 

43.40 46. 8 /l 
20.60 19.19 

4* 18 3.96 J 

80.26 104,3g i 

19.20 26.35 | 

133.72 121.9 8 
52.60 47.8o i 
20.24 20.1$ I 

4.96 4.12’J 

84.78 68.96 1 

17.09 I6.74 

129.56 127.08 * 

44.40 53.80 

20.34 20.02 

4.18 4.96 ' 

82.74 114.l|| 

19.79 23.01,1 


38.60 

20.46 

3.56 

90.48 

25.42 


.103035.87000 

44814.78400 

9335.66200 

198199.66200 

48016.45650 


104619.95500 

44800.21258 

9488.29383 

198624.29583 

47300.86142 


19540.58862 

4062.63695 846.32530 

17964 * 13 050 38170.87290 

.98705 4351.16635 92390.13750 22418.60^ 


19548.35585 

4060.99354 

86200 Sl»« , 861 - 83f >88 

20919 ! 62470 4277;“° 32 383842 .35742 

994 92690.46031 229 






distributed. The atlal . . A 

L be done in the sam ° f desl . 

, same way as ... esi 3» With fi 

tw o-way classif ication he an aiys 1Xed effect mode 

block-treatment interaction" 6 L 

i- 6 • 5 * hout any 

details the difTe^T 7 ^ a --^lDesi^ l! 


*.,V 




'^AR 


With f «ed eff 
is „< eff ect 


mm 


, n, , Ct toodeffb 

u t;i -variate M 


f 


L i 


details the diffe^T7^ a ~^-£S£iW ! We ha , 

ent aspects ofT^ have Ascribed ; 

Chapter 5. The l=,—. . 1 bivariate l,*,. ed 10 


Chapter 5. The layout 7 ° f Univ «:.ate lati UeSCribsd « 

layout for the mnH-- square desim. • 

the univariate one but tv, ultlva riate case is th _ 11 

e DUt the observa+--f . the same as 

vector (observations on n rpo 13 a P-^nsional 

i i v = i P es P° n ses) x. = ( x 

1 ' J,K 1 . v, where x. . is tv» K ljk UU. x iik J\ 

treatment in the ith row and tv • ° S6rvatlon vector on the kth 

mathematical model of the ! 1 ° 1Umn - 

,i„. bv *• •*“« .™>. -i. „ 


ijk^ — + —' + + t , Cov ^-ijk^ (positive definite) 

- ^ 


—j. j js. — — — —i 9 —ij]^' u \jjujdxu±vc a ermine; 

where the triplets (i,j,k) take on only the set S of values 
indicated by the particular latin square selected for the 
experiment, y = (y^ ...,y )' is the grand mean, ^t 1 = 
in, a ) ' is the ith row effect such that E a ] ‘=0, 6 J = 

(3 , . . ., 3 .) ’ is the ith column effect such that 16=0, 

J ^ ^ rr _i. 4-haf 


and t k = * * *' fc kp 

I t k = 0 . 


j 

) 1 is the kth treatment effect such that 

I 


this design the hypotheses of interest are 

i 0 „ .all =0, V allt - 0. 

; all «. °' 1 " - is a multivariate linear 

r n 1 each hypothesis i 2 

notation of 1-1 = 3v _ 2 , N-s=v -3 v 


the notation 


hypothesis w 

= (v-l) * 


with r 
i . Let 


)-l> s 


= l+3v- 


1 —ijk' 

,k)eS 


L i--” (3.kl aS 


x = E 


- u ~‘ 

Under 11 q 
given by 


3 tiiuate 


of L' 


^.tM ar ' 
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y = 2, 


i __ 


a - 2 


■l. • 


A i v ,t =x i-x , 

-x = £.j. -- •• - 


2: 

v E- 


{(X . —x. -2 j.-S..k 

l '2ijk — x. • •-> * 


+2x ) 


(i,eS 
X (*ijk ' -i- 


- *.j 


- x , + 2x 
.k 


) • } = g (say). 


W < - . 


„ matrix G (with values g) is distributed as W p U, 

The random ma , 1} ( v _2) > p. Under the hypothesis 

(v-iHv-2)) pd estima tes of n,ot ± ,e?.Z are given by 

H the maximum lixexi 



- "i _ x _ x 3 J =x , -2 

y = x > ot - 2j_ _. . — • J • 

““ • • • 


2 V . 
v E - 


(i, j , k) eS 


, v x _x +x ) (x, ..-x, -2 -j +2 ) ' 

—ijk - —i.. “O*. — • * * 1 * °* •** 


=v 2 +Z(x v -x 

k -..k . 


) (2 v"2 ) ' 

. • JV • • • 


, a (7.77) 

= g+c (say). 

The random matrices , G, C (with values c) are independent and 
under H t C is distributed as Wp(£,v-1) provided that v-1 _> p. 
From (7776)-(7.77) the likelihood ratio test of H fc rejects 
whenever 


det g 

U 1 det (g+c) - 


< k 


where the constant k depends on the level of significance a of 


the test and under the statistic (with values u^) is 
distributed as U p , v-T,(v-1)(v-2) 


Similarly the likelihood ratio test of 
whenever 


H rejects H 
a J a 


u = - get 9 ._ „ 
2 det(a+g) — 


< k 


where a Zv ^2^ #> “2 ^ (2^ ~2 ) 1 and k is a constant 

depending on the ievei of significance a of the test and under 

Tt * 1 ... 


the statistic U 2 (with values u ) is distributed 


as U 


p,v-l,(v-l)(v-2)* 

The likelihood ratio test of the hypothesis H Q rejects 
Hg whenever 2 
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r -SSS-a-r < y 

* det (b+g) - 

Where b = .£ v . j ...) ( ^.j.and k is a constant 

^pending on the level of significance a of the test and under 

a the statistic U_ (with values u~) is distributed as 
a ° J 

«p,v-li(v-1)(v- 2 )* 

Other invariant tests of H , H. and H. can be obtained 

a 3 t 

following (7.14)-(7.17). 

If any of the hypotheses/ say, H is rejected then we- need 

i k. 

to test hypothesis of the form H*: I c^t = £ where the c^ are 

—— k 

real constants such that Z c = 0 and £ is a fixed vector. The 

k K 

optimum invariant test of H* rejects H* whenever 

(v-1) (v-2) X (Z c k x <k -£) ' Q 1 (J c k-. .k " 

k k 

> [Vl ) Z-V) -P + I P l-a.p,(v-l><v-2>-P + l' 


where 


= 2 3- . Note that 


(v-1) (v-2)X(Z c k ..k 
k 


_a) 1 G ^ (2 c y. .k — 
■ - k 


is distributed as Hotelling's T 2 -statistic with degrees of 
freedom (v-1) (v-2) and noncentrality parame er 

k \ v" 1 f Z c t — 0 ) • 

•x / V* .m -I— a V L \ u '-v— — 


A( 1 c k t k - I'M 1 c,t - o 

* . f _ or Hq or both should be similarly treated. 

The rejection of 3. 

7.7. MULTIVARIATE ANALYSIS OF COVA of univariate one-way 

We have treated the analysis 4 and those of different 

and two-way classifications m Chapt 
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designs in Chapter 5. for the multivariate analysis of 
covariance the concepts are the same but the observations and 
parameters are vectors. As in the univariate case there is a 
multi.ariate analysis of covariance model for every multivariate 
analyses of variance model. For example the analysis of 
covariance model for the two-way classification without inter¬ 
action can be written as 


E (Xij) = y + a 1 + 3 :3 + Au^j, i = 1,.../I, j 
Cov(X^j) = £ (positive definite) , 
or equivalently 


— 1, • • • , J 


E(X) = Bz+Au = (3, A) (*) , 

Cov(X. .) = £ 

where $,z are as defined in (7.75) and u^ 
a vector of k concomitant variates 




' u ijk> ’ is 


u (ii 2 ^ *•••* Hij f f Hj / • • • / H j j) 

is a k x (ij) matrix and A is a p x k. matrix of unknown 
coefficients which are assumed to be the same for all (i, j) . 

Under this model the design matrix is (^) and (3,A) is the 
parameter matrix. Thus, as in the case of univariate analysis 
of covariance, the multivariate analysis of covaiiance can be 
treated as the multivariate regression model as discussed in 
Section 7.5. The problem of estimation and testing hypotheses 

on (3,A) can be resolved from the methodology developed in 
Section 7.5. 

7.8. CASE OF MULTINOMIAL AND COMPLEX MULTIVARIATE NORMAL 
DISTRIBUTIONS 11AAi 

Multivariate analysis of variance of one-way, two-way and higher¬ 
way classifications with discrete data following multinomial 

distributions has been treated by Roy (1957) , Lehmann (1959) , 
and Rao (1965) , 

We shall give some recent developments regarding the 

complex multivariate aen^rai , 

general linear hypothesis which is defined 




for the complex multivariate ^ HYP0THES 'S 38. 

as the multivariate li„ ear hy^. di * tribUti ° ns ln the -V 
distribution. The test statistic ^ multivar ^te normal 

on the characteristic roots of t 8PendS/ as in the real case, 
matrices. The distribution * independent complex Wishart 

characteristic roots of statistics based on the 

oots of complex Wish^rt- 4- 

useful in multiple time series anal - , matriCei ls also 

noncentral distribution t ySls (Hannan 1970). The joint 

Kishart matri " as ^ —ic roots of complex 

linear hypotheLs mlT ^ “* l “ a -neral 

terms of Zonal 1 WSre 91Ven explicitl y b ? Ja "®s <1964) in 

e Of Zonal polynomials and by Khatri (1964a) in the form of 

in egra s. In the case of central complex Kishart matrices and 

complex matrices connected with the complex multivariate general 

linear hypothesis, the distribution of extreme characteristic 

roots were obtained by Pillai and Young (1970) and Pillai and 

uouris (1972). Khatri (1946b,1970) obtained the noncentral 

distribution of the individual characteristic roots of the 

complex matrices associated with the complex general linear 

hypothesis and that of traces. 

7.9. STATISTICAL TABLES ON WILKS' CRITERION U=U 

p, r, N-s 

In this section we shall reproduce, with the kind permission of 
the Editor, Biometrika, the tables of Schatzoff (1966) and of 
Pillai and Gupta (1969) for the chisquare adjustment factor 

[ percentile for {- (N-s-^ (p-r+1) ) log e U) ] 

C = -- — 

percentile for x r 

2 

for converting the upper percentile points of X pr to get the 
exact percentile points of {-(N-s-’-i(p-r+1) log e U} (lower 
percentile point of U) for different values of P,r,M = N-s-p+1. 

In other words to obtain the required percentile point 
-< N -s-Mp-r + l»log Owe multiply the corresponding upper 

e 2 b t h e tabulated value of the adjustment 
percentile point o X Schat2off (1966) and Table 7.3 is 

factor C. Table 7.2 is aue 

due to Pillai and Gupta (1969) . 
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Table 7.2 


, hl _ squ are adjustments to Wilks's criterion D. Pact 
Tables of cm h s of U (upper percentiles of / \ 

for lower percentiles 




1-000 

1-000 

1-000 

1-000 1-000 

23-5418 

26-2962 

28-8454 

31-9999 34-2672 



6, r = 4 



i-OOO 

33-1903 36-4151 39-3041 


m 
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Table 7.2 (cont.) 


n 

ix 

o-ioo 

P = 7 . 

0-050 

r = 4 

0-025 

1 ' i 

2 

H 4 

1-497 

1 - 55 ^ 

1-601 

1-244 

1-263 

1-281 

1-155 

1-165 

1-175 

0 

A 

1-109 

1-116 

1-122 

6 

1-082 

1-087 

1-092 

6 

1-064 

1-068 

1-071 

7 

i -052 

1-055 

1-057 

8 

1-043 

1-045 

1-047 

9 

1-036 

1-033 

1-040 

1 10 

1-031 

1-032 

1-034 


0-010 0-005 


12 

1-023 

1-024 

14 

1-018 

1-019 

16 

1-015 

1-015 

18 

1-012 

1-013 

20 

1*010 

1-011 

24 

1-007 

1-008 

30 

1-005 

1-005 

40 

1-003 

1-003 

60 

1-001 

1-001 

120 

1-000 

1-000 

00 

1-000 

1-000 

2 

v 

37-9159 

41-3372 


1-026 

1-020 

1-016 

1-013 

1-011 

1-008 

1-005 

1-003 

1-002 

1-000 

1000 

44-4607 


1-667 

1-305 

1-188 

1-130 

1-007 

1-076 

1-061 

1-050 

1-042 

1-036 

1-027 

1-021 

1-017 

1-014 

1-012 

1-008 

1-006 

1-003 

1-002 

1-000 


1-715 

1-322 

1-197 

1-136 

1-101 

1079 

1-063 

1-052 

1-044 

1-037 

1-028 

1-022 

1-017 

1-014 

1-012 

1-009 

1-006 

1-004 

1-002 

1-000 


1-000 1-000 
48-2782 50-9933 


M 

1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

12 

14 

16 

18 

20 

24 

30 

40 

60 


.a 0-100 

1-657 
1-288 
1-189 
1-137 
1-105 

1-083 
1-068 
1-057 
1-048 
1-041 

1-032 
J -025 
1-020 
1-017 
1-014 

1-010 
1-007 
1-004 
1-002 
1-001 


00 

4 


1 000 
47*2122 


p = 9 , 
0-050 

1-614 

1-309 

1-201 

1-144 

1-110 

1-088 

1-071 

1-060 

1-050 

1-043 

1-033 

1-026 

1-021 

1-018 

1-015 

1-011 

1-007 

1-004 

1-002 

1-001 

1-000 

50-9985 


r = 4 


0-025 

0010 

1-669 

1-740 

1-329 

1-355 

1-212 

1-223 

1-152 

1-161 

1-115 

1-122 

1*091 

1-096 

1-075 

1-078 

1062 

1-065 

1-053 

1-055 

1-046 

1-047 

1-034 

1-036 

1-027 

1-029 

1-022 

1-023 

1-018 

1*019 

1-015 

1-016 

1-011 

1-012 

1-008 

1-008 

1-005 

1-005 

1-002 

1-002 

1001 

1-001 

1-000 

1-000 

54*4373 

58*6192 


0-100 

1-528 

1-266 

1-172 

1-123 

1-093 

1-074 

1-060 

1-050 

1-042 

1-036 

1-027 

1-021 

1-017 

1-014 

1-012 

1-009 

1-006 

1-003 

1-002 

1-000 

1-000 

42-5847 


P = 8 ,r = 4 
0-050 0-025 0-010 


1-583 

1-280 

1-183 

1-130 

1-099 

1-078 

1-063 

1-052 

1-044 

1-038 

1-029 
1-023 
1-018 
1-015 
1 013 

1-009 

1-006 

1-004 

1-002 

1-000 

1-000 

46-1943 


l - r 36 

1-305 

1193 

1-137 

1-103 

1-081 

1-066 

1-055 

1-046 

1-039 

1-030 

1-023 

1-019 

1-016 

1-013 

1-010 

1-007 

1-004 

1-002 

1-000 

1-000 

49-4804 


1-704 

1-330 

1-207 

1-146 

1-109 

1-086 

1-070 

1-058 

1-048 

1-041 

1-031 

1-025 

1-020 

1-016 

1-014 

1-010 

1-007 

1-004 

1-002 

1-001 


0-005 

1-764 

1-348 

1-216 

1-162 

1-114 

1-089 

1-072 

1-060 

1-050 

1-043 

1-033 

1-026 

1-021 

1-017 

1-014 

1-010 

1-007 

1-004 

1-002 

1-001 


1-000 1-000 
53-4858 56-3281 


0-005 0-100 


p = 10 , r = 4 
0-060 


1-792 
1-373 
1-236 
1-167 
1-127 

1-100 
1-081 
1-068 
1-057 
1-049 

1-037 

1-029 

1-024 

1-020 

1-017 

1-012 

1-008 

1-005 

1-002 

1*901 

1-000 

61-5812 


1-585 

1-309 

1-206 

1-150 

1-116 

1-093 

1-076 

1-064 

1-054 

1-047 

1-036 

1-029 

1-023 

1-019 

1-016 

1-012 

1-008 

1-005 

1-002 

1-001 


1-644 

1-331 

1-218 

1-159 

1-122 

1-097 

1-080 

1-067 

1-057 

1-049 

1-038 

1-030 

1-024 

1-020 

1-017 

1-013 

1-009 

1-005 

1-003 

1-001 


0-025 

0-010 

0-005 

1-701 

1-774 

1-828 

1-352 

1-379 

1-398 

1-230 

1-244 

1-255 

1-166 

1-176 

1-183 

1-128 

1-134 

1-139 

1-102 

1-107 

1-111 


1-083 

1-070 

1-059 

1-051 


1-088 

1-073 

1-062 

1-054 


1-000 i° 00 


1-039 

1-041 

1-031 

1-033 

1-025 

1-026 

1-021 

1-022 

1-018 

1-019 

1*013 

1-014 

1-009 

1-009 

1-005 

1-006 

1-003 

1-003 

1-001 

1*001 

1-000 

1-000 

69*3417 

63-6907 


1-076 

1-064 

1-055 

1-042 

1-034 

1-027 

1-023 

1-019 

1-014 

1-010 

1-006 

1-003 

1-001 

1-000 





analysis of variance 


Table 7.2 (cont.) 


.X 


M 

1 

O 

t* 

3 

4 

5 

0 

7 

8 
n 

10 

12 

14 

10 

18 

20 

24 

30 

40 

60 

120 

oo 

2 


pr 



o-ioo 

0-060 

1-482 

1-635 

1-222 

1-241 

1-136 

1-145 

1-092 

1-099 

1-068 

1-072 

1-062 

1-056 

1-041 

1-044 

1-034 

1-030 

1-028 

1-030 

1 024 

1-025 

1-018 

1-019 

1-014 

1-014 

1-011 

1-012 

1-009 

1-009 

1-007 

1-008 

1-006 

1-006 

1-004 

1-004 

1-002 

1-002 

1-001 

1-001 

1-000 

1-000 

1-000 

1-000 

26-9894 

28-8093 


= 3 . r = 6 
0026 


1-680 

1-269 

1-166 

1-106 

1-077 

1-069 

1-047 

1-038 

1-032 

1-027 

1-020 
1016 
1-012 
1-010 
1-008 

1-006 

1-004 

1-002 

1-001 

1-000 

1-000 


o-oio 

1-649 

1-282 

1-107 

1-113 

1-082 

1 - 0 G 3 

1-060 

1-041 

1-034 

1-028 

1-021 

1-010 

1-013 

1-011 

1-009 

1-000 

1-004 

1-002 

1-001 

1-000 


0-006 

i -694 

1-298 

1-170 

1-119 

1-080 

1-000 

1-062 

1-042 

1-036 

1-030 

1-022 

1-017 

1-014 

1-011 

1-009 

1-007 

1-004 

1-003 

1-001 

1-000 


p s 4 , r = 0 

The values for G ore obtained by entering 
the table f° r P = ^ > r “ ^ * 
with M =N-s-3 


1-000 1-000 
34-8063 37 - 1664 ! 




V = 6 , 

r = 6 




V ~ 

0 , r = 6 



,X 

0-100 

0-060 

0-025 

0-010 

0005 

0-100 

0-050 

0-025 

0-010 

0-005 

1 

1-465 

1-614 

1-563 

1-625 

1-671 

1-471 

1-620 

1-568 

1-631 

1-677 

2 

1-228 

1-245 

1-262 

1-284 

1-300 

1-237 

1-255 

1-272 

1-294 

1-310 

3 

1-144 

1-154 

1-163 

1-175 

1-183 

1-153 

1-103 

1-172 

1-183 

1-192 

4 

1-102 

1-108 

1-114 

1-121 

1-127 

1-109 

1-116 

1-122 

1-129 

1-134 

5 

1-077 

1-081 

1-085 

1-090 

1-094 

1-083 

1-088 

1-092 

1-097 

1-101 

6 

1-000 

1-003 

1 - 06 C 

1-070 

1-073 

1 - 06 G 

1-069 

1-072 

1-076 

1-079 

7 

1-048 

1-051 

1-053 

1-056 

1-059 

1-053 

1-056 

1-058 

1-061 

1-064 

8 

1-040 

1 042 

1-044 

1-046 

1-048 

1-044 

1-046 

1-048 

1-051 

1-053 

9 

1-034 

1-035 

1-037 

1-039 

1-040 

1-037 

1-039 

1-041 

1-043 

1-044 

10 

1-029 

1-030 

1-031 

1-033 

1-034 

1-032 

1-034 

1-035 

1-037 

1-038 

12 

1-022 

1-023 

1-024 

1-025 

1-026 

1-024 

1-025 

1-026 

1-028 

1-029 

14 

1-017 

1-018 

1-019 

1-019 

1-020 

1-019 

1-020 

1-021 

1-022 

1-022 

16 

1-014 

1-014 

1-016 

1-016 

1-016 

1-015 

1-016 

1-017 

1-018 

1-018 

18 

1-011 

1-012 

1-012 

1-013 

1-013 

1-013 

1-013 

1-014 

1-014 

1-015 

20 

1-009 

1-010 

1-010 

1-011 

1-011 

1-011 

1-011 

1-012 

1-012 

1-013 

24 

1-007 

1-007 

1-007 

1-008 

1-008 

1-008 

1-008 

1-009 

1-009 

1-009 

30 

1-005 

1-005 

1-005 

1-005 

1-005 

1-005 

1-006 

1-006 

1-006 

1 - 00 G 

40 

1-003 

1-003 

1-003 

1-003 

1-003 

1-003 

1-003 

1-003 

1-004 

1-004 

60 

1-001 

1-001 

1-001 

1-001 

1-002 

1-002 

1-002 

1-002 

1-002 

1-002 

120 

1-000 

1-000 

1-000 

1-000 

1-000 

1-000 

1-000 

1-000 

1-000 

1-000 

00 

„2 

1-000 

1-000 

1-000 

1-000 

1-000 

1-000 

1-000 

1-000 

1-000 

1-000 

V 

40-2560 

43-7730 

46-9702 

60-8922 

63-6720 

1 47-2122 

60-0985 

54-4373 

58-6192 

61-5812 




12 

14 

16 

18 

20 


24 

30 

40 

GO 

120 


00 

2 

X 


P r 
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Table 7.2. (cont.) 


0100 


p - 7 , r = o 
0-050 0025 


0010 0-005 0-100 


p = 8, r = 6 

0-050 0-025 0-010 


0-005 


1-481 

1-249 

1-163 

1*118 

1-090 


1-530 

1-266 

1-173 

1-124 

1-095 


1-579 

1-284 

1-182 

1-131 

1-099 


1-642 

1-306 

1-194 

1-138 

1-105 


1-C88 

1-322 

1-203 

1-144 

1-109 


1-494 

1-261 

1-174 

1-127 

1-098 


1-543 

1-279 

1-184 

1-134 

1-103 


1-593 

1-297 

1-194 

1-140 

1-108 


1-650 

1-319 

1-205 

1-148 

1-113 


1-703 

1-336 

1-214 

1-153 

1-117 


1-072 

1-059 

1-049 

1-042 

1-036 


1-075 

1-062 

1-051 

1-043 

1-037 


1-079 

1-064 

1-053 

1-045 

1-039 


1-083 

1-067 

1 - 05 G 

1-047 

1-041 


1-086 

1-070 

1-058 

1-049 

1-042 


1-079 

1-065 

1-054 

1-046 

1-040 


1-082 

1-068 

1-057 

1-048 

1-042 


1-086 

1-070 

J -059 


1-060 

1-043 


1-090 

1-074 

1-062 

1-052 

1-045 


1-093 

1-076 

1-063 

1-054 

1-046 


1-027 

1-022 

1-018 

1-014 

1-012 


1-029 

1-023 

1-018 

1-015 

1-013 


1-030 

1-023 

1-019 

1-016 

1-013 


1-031 

1-024 

1-020 

1-01G 

1-014 


1-032 
1-025 
1-020 
1 017 
1-014 


031 

024 

020 

016 

014 


1-032 

1-025 

1-021 

1-017 

1-014 


1-033 

1-026 

1-021 

1-018 

1-015 


1-035 

1-027 

1-022 

1-018 

1-015 


1-036 

1-028 

1-022 

1-019 

1-016 


1-009 

1-006 

1-004 

1-002 

1-000 


1-009 

1-006 

1-004 

1-002 

1-000 


1-010 

1-007 

1-004 

1-002 

1-001 


1-010 

1-007 

1-004 

1-002 

1-001 


1-010 

1-007 

1-004 

1-002 

1-001 


1-010 

1-007 

1-004 

1-002 

1-001 


1-011 

1-007 

1-004 

1-002 

1-001 


1-011 

1-008 

1-005 

1-002 

1-001 


1-011 

1-008 

1-006 

J-002 

1-001 


1-012 

1-008 

1-005 

1-002 

1-001 


1-000 

54-0902 


1-000 

68-1240 


1-000 

61-7768 


1-000 

66-2062 


1-000 
69-3360 j 


1-000 

60-9066 


1-000 

65-1708 


1-000 

69-0226 


1-000 

73-6826 


1-000 

76-9688 


p = 9 , r = 6 


\oc 

0-100 

0-050 

0-025 

0-010 

M \ 

1 

1-508 

1-558 

1-607 

1-671 

2 

1-275 

1-293 

1-311 

1-333 

3 

1-185 

1-196 

1-205 

1-218 

4 

1-137 

1-144 

1-150 

1-158 

5 

1-107 

1-112 

1-116 

1-122 

6 

1-086 

1-090 

1-093 

1-098 

7 

1-071 

1-074 

1-077 

1-080 

8 

1-060 

1-062 

1-065 

1-967 

9 

1-051 

1-053 

1-055 

1-058 

10 

1-044 

1-046 

1-048 

1-050 

12 

1-034 

1-035 

1-036 

1-037 

14 

1-027 

1-028 

1-029 

1-031 

16 

1-022 

1-023 

1-024 

1-024 

18 

1-019 

1-019 

1-020 

1-020 

20 

1-016 

■Vi. 

1-016 

1-017 

1-017 

24 

1-012 

1-012 

1-013 

1-013 

30 

1-008 

1-008 

1-009 

1-009 

40 

1-005 

1-005 

1-005 

1-005 

60 

1-002 

1-002 

1-003 

1-003 

120 

1-001 

1-001 

1-001 

1-001 

00 

1-000 

1-000 

1-000 

1-000 

v 2 

67-6728 

72-1532 

76-1921 

81-0688 


0-005 


1-719 

1-350 

1-227 

1-164 

1-126 


1-101 

1-083 

1-069 

1-059 

1-051 



P - 

10 , r = 

6 


0100 

0-050 

0-025 

0-010 

0-005 

1-623 

1-573 

1-623 

1-687 

1-736 

1-288 

1-307 

1-325 

1-348 

1-365 

1-197 

1-208 

1-218 

1-230 

1-239 

1-147 

1-164 

1-161 

1-169 

1-176 

1-115 

1-120 

1-125 

1-131 

1-135 

1-093 

1-097 

1-101 

1-106 

1-109 

1-078 

1-081 

1-084 

1-087 

1-090 

1-066 

1-068 

1-071 

1-074 

1-076 

1 - 05 C 

1-059 

1-061 

1-063 

1-065 

1-049 

1-051 

1-053 

1-055 

1-056 


1-037 

1-031 

1-025 

1-021 

1-018 


1-038 

1-031 

1-025 

1*021 

1-018 


1-013 

1-009 

1-006 

1-003 

1-002 


1-013 

1-009 

1-006 

1-003 

1-001 


1-000 


1-000 

74-3970 


1-040 

1-041 

1-042 

1-043 

1-032 

1-033 

1-034 

1-034 

1-026 

1-026 

1-027 

1-028 

1-021 

1-022 

1-023 

1-023 

1-018 

1-019 

1-019 

1-020 

1-014 

1014 

1-015 

1-015 

1010 

1-010 

1-010 

1-010 

1-006 

1-006 

1-006 

1-006 

1-003 

1-003 

1-003 

1-003 

1-001 

1-001 

1-001 

1-001 

1-000 

1-000 

1-000 

1-000 

79-0819 

83-2976 

88-3794 

91-951 




pr 







P = 7 , 

\a 

0-100 

0-050 

J»/\ 



1 

1-490 

1-538 

2 

1-205 

1-282 

3 

1-179 

1-189 

4 

1-132 

1-139 

5 

1-103 

1-108 

6 

1-083 

i -086 

7 

1-068 

1-071 

8 

1-058 

1-060 

9 

1-049 

1-051 

10 

1-043 

1-044 

12 

1033 

1-034 

14 

1-026 

1-027 

16 

1-021 

1-022 

18 

1-018 

1-018 

20 

1-015 

1-016 

24 

1-011 

1-012 

30 

1-008 

1-008 

40 

1-005 

1-005 

60 

120 

1-002 

1-001 

1-002 

1-001 

00 

1-000 

1-000 

X Pr 

69-9186 

74-4683 


1000 

78-6072 


1-037 

1-028 

1-023 

1-020 

1-017 

1-012 

1-009 

1-006 

1003 

1-001 

1-000 

83-6134 


1-000 

80-0938 


1000 

78-8596 


1-000 1-000 
83-6753 88-0041 


1-000 

93-2169 


1-000 

90-8781 
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Table 7 . 2 . ( cont .) 


P = 3 t r - 10 
0060 0-026 


0-010 0-006 0-100 


P ~ B,r = 10 
0-060 0-026 0-010 


1-710 

1-369 

1-232 

1-167 


1-781 

1-383 

1-246 

1-176 


1-862 

1-413 

1-262 

1-187 


W 6 

1-120 

1-127 

1-133 

1-141 

If 6 

1.095 

1.101 

1.106 

1.112 

F . 7 

1.078 

1.082 

1.086 

1.091 

1 ' g 

1.065 

1.068 

1.072 

1.075 

r,. ‘9 

1.066 

1.058 

1.061 

1.064 

10 

1.047 

1.050 

1.051 

1.055 

12 

1-030 

1038 

1-040 

1-042 

14 

1-029 

1-030 

1031 

1-033 

: 10 

1-023 

1024 

1-025 

1-027 

18 

1-019 

1-020 

1-021 

1-022 

20 

1-016 

1-017 

1-018 

1-019 

24 

1-012 

1-012 

1-013 

1-014 

30 

1-008 

1-009 

1-009 

1-009 

40 

1-005 

1-005 

1-005 

1-006 

60 

1-002 

1-002 

1-003 

1-003 

120 

1-001 

1-001 

1-001 

1-001 

CO 

1-000 

1000 

1-000 

1-000 

x 2 r 

40-2560 

43-7730 

46-9792 

60-8922 

A pr 







P = T 

r = 10 


\« 

0-100 

0-050 

0025 

0-010 

M \ 





1 

1-509 

1-557 

1-604 

1-666 

2 

1-285 

1-303 

1-320 

1-342 

3 

1-197 

1-208 

1-217 

1-229 

4 

1-148 

1-155 

1-162 

1-169 

5 

1-117 

1-122 

1-127 

1-132 


1-921 

1-436 

1-274 

1-196 

1-147 

1.116 

1.095 

1.078 

1.066 

1.067 

1-043 

1-034 

1-028 

1-023 

1-019 

1-014 

1-010 

1-006 

1-003 

1-001 

1-000 


1-647 

1-296 

1-199 

1-147 

1-116 

1.092 

1.076 

1.064 

1.055 

1.048 

1-037 

1-029 

1-024 

1-020 

1-017 

1-013 

1-009 

1-006 

1-003 

1-001 

1-000 

63-1671 



6 1-096 

7 1-080 

8 1-068 

9 1-058 

10 1051 

12 1040 

14 1032 

16 1026 

18 1-022 

20 1-019 

24 1-014 

30 1-010 

40 1-006 

60 1-003 

20 1-001 

<*> 1-000 

2 

Xp r 88-6271 


1-099 

1-083 

1-070 

1-060 

1-053 

1-042 

1-034 

1-028 

1-023 

1-019 

1-014 

1-010 

1-006 

1-003 

1-001 

1 000 

90-6312 


1 

1 

1 

1 

1 

1 - 

1 - 

1 - 

1 

1 - 

1 

1 

1 

1 

1 - 

1 

06 


103 

086 

073 

002 

■054 

■042 

034 

■028 

•023 

■020 

•016 

■010 

•006 

■003 

■001 

■000 


1-107 

1-089 

1-075 

1-065 

1-056 

1-044 

1-036 

1-020 

1-024 

1-020 

1-016 

1-011 

1-007 

1-003 

1-001 


1-600 

1-315 

1-211 

1-165 

1-120 

1.097 

1.080 

1.067 

1.057 

1.050 

1-038 

1-031 

1-025 

1-021 

1-018 

1-013 

1-009 

1-005 

1-003 

1-001 

1-000 

67-6048 


1-653 

1-334 

1-221 

1-102 

1-125 

1.101 

1.083 

1.070 

1.059 

1.051 

1 -OJO 

1-032 

1-020 

1-022 

1-018 


1-721 

1-359 

1-236 

1-171 

1-131 

1.105 

1.087 

1.073 

1.0 

1.0 

HU l 
1-033 
1-027 
1-022 
1-019 


0-005 

1-772 

1-377 

1-244 

1-177 

1-130 

1.109 

1.089 

1.076 


1-013 

1-034 

1-028 

1-023 

1-019 


1-014 1-014 1-014 

1-009 1-010 1-010 

1-006 1 006 1-006 

1-003 1-003 1-003 

1-001 1-002 1-002 

1-000 1-000 1-000 

71-4202 76-1539 79-4900 


0-005 

1-713 

1-359 

1-238 

1-175 

1-136 

1-110 

1-091 

1-077 

1-066 

1-058 

1-045 

1-036 

1-029 

1-024 

1-021 

1-016 

1-011 

1-007 

1-003 

1-001 


For p = 4 , q =10 enter the table 
for p = 10 , r =4 with M = N - s -3 
For p = 6 , r =10 enter the table 
for p = 10 , r =6 with M = N - s -5 


1-000 1 000 


Source 


0231100-4260104-2160 

: M.Schatzoff (1966), Biometrika, PP- 347 '^ 
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Chi-square adjustments to W 
lower percentiles of 

p = 3 , r =12 


A---- 


\ 

x 0100 

0-050 

0-025 

0-010 

0-006 

M 

\ 





1 

1-718 

1-791 

1-860 

1-949 

2-013 

2 

1-382 

1-410 

1-437 

1-470 

1-495 

3 

1-256 

1-272 

1-287 

1-306 

1-319 

4 

1-188 

1-199 

1-209 

1-221 

1-230 

5 

1-146 

1-154 

1-161 

1-170 

1-176 

6 

1-117 

1-123 

1-129 

1-136 

1-141 

7 

1-097 

1-101 

1-106 

1-111 

1-115 

8 

1-081 

1 - 0 S 5 

1-089 

1-093 

1-097 

9 

1-069 

1-073 

1-076 

1-080 

1-082 

10 

1-060 

1-063 

1-066 

1-069 

1-071 

12 

1-046 

1-048 

1-050 

1-053 

1-054 

14 

1-037 

1-039 

1'040 

1-042 

1-043 

16 

1-030 

’■032 

1-033 

1-034 

1-035 

18 

1-025 

1-026 

1-027 

1-029 

1-029 

20 

1-021 

1-022 

1-023 

1-024 

1-025 

24 

1-016 

1-017 

1-017 

1-018 

1-019 

30 

1-011 

1-011 

1-012 

1-012 

1-013 

40 

1-007 

1-007 

1-007 

1-008 

1-008 

60 

1-003 

1-003 

1-004 

1-004 

1-004 

120 

1-001 

1-001 

1-001 

1-001 

1-001 

D 

1-000 

1-000 

1-000 

1-000 

1-000 

Xpr 472122 

50-9985 

64-4373 

58-6192 

61-5812 




r=16 




r 


-*- 




\ 

M 

a 0-100 

\ 

0-050 

0-025 

0-010 

0-005 

1 

1-835 

1-916 

1-995 

2-095 

2-169 

2 

1-469 

1-501 

1-532 

1-571 

1-599 

3 

1-325 

1-344 

1-362 

1-384 

1-400 

4 

1-245 

1-258 

1-271 

1-285 

1-296 

5 

1-195 

1-204 

1-213 

1-224 

1-232 

6 

1-159 

1-167 

1-174 

1-182 

1-188 

7 

1-133 

1-139 

1-145 

1-152 

1-157 

8 

1-114 

1-119 

1-123 

1-129 

1-133 

9 

1098 

1-102 

1-106 

Mil 

1-116 

10 

1-085 

1-089 

1-092 

1-097 

1-099 

12 

1-067 

1-070 

1-073 

1-070 

1-078 

14 

1-054 

1-057 

1-059 

1-061 

1-063 

10 

1-045 

1-047 

1-049 

1-051 

1-052 

18 

1-038 

1-039 

1-041 

1-043 

1-044 

20 

1-032 

1-034 

1-035 

1-036 

1-037 

24 

1-025 

1-026 

1-026 

1-027 

1-028 

30 

1-017 

1-018 

1-018 

1-019 

1-020 

40 

1011 

1-011 

1-011 

1-012 

1-012 

60 

120 

1-005 

1-002 

1-006 

1-002 

1-006 

1-002 

1-006 

1-002 

1-006 

1-002 

W 

2 

1-000 

1-000 

1-000 

1-000 

1-000 

Y 1 

*pr 

00-9066 

65-1708 

69-0226 

73-6826 

76-9688 


lks’s criterion U . Factor C for , 
U (upper percentiles of Xp r ) 


p=3,r=14 


0-100 

0-050 

0-025 

0-010 

0-005 

1-780 

1-857 

1-931 

2-026 

2-095 

1-427 

1-458 

1-486 

1-523 

1-549 

1-292 

1-309 

1-326 

, 1-346 

1-361 

1-217 

1-229 

1-240 

1-254 

1-264 

1-171 

1-179 

1-188 

1-198 

1-205 

1-138 

1-145 

1-152 

1-159 

1-165 

1-115 

1-121 

1-126 

1-132 

1-136 

1-097 

1-102 

1-106 

1-111 

1-115 

1-084 

1-088 

1-091 

1-095 

1-099 

1-073 

1-076 

1-079 

1-082 

1-085 

1-057 

1-059 

1-061 

1-064 

1-066 

1-046 

1-048 

1-049 

1-052 

1-053 

1-037 

1-039 

1-041 

1-042 

1-044 

1-031 

1-033 

1-034 

1-035 

1-036 

1-027 

1-028 

i -029 

1-030 

1-031 

1-020 

1-021 

1-022 

1-023 

1-023 

1-014 

1-015 

1 - 01.5 

1-016 

1-016 

1-009 

1-009 

1-009 

1-010 

1-010 

1-004 

1-004 

1-005 

1-005 

1-005 

1-001 

1-001 

1-001 

1-001 

1-001 

1-000 

1-000 

1-000 

1-000 

1-000 

54-0902 

58-1240 

61-7768 

66-2062 

69-3360 



r =18 

A 



f 

0-100 

0-050 

0-025 

0-010 

0-005 

1-886 

1-971 

2-053 

2-158 

2-235 

1-508 

1-542 

1-575 

1-616 

1-646 

1-357 

1-377 

1-396 

1-420 

1-437 

1-272 

1-286 

1-299 

1-315 

1-327 

1-218 

1-228 

1-238 

1-249 

1-258 

1-179 

1-188 

1-195 

1-204 

1-211 

1-151 

1-158 

1-164 

1-171 

1-177 

1-129 

1-135 

1-140 

1-146 

1-151 

1-112 

1-117 

1-121 

1-127 

1-130 

1-099 

1-103 

1107 

1-111 

1-114 

1-078 

1-081 

1-084 

1-087 

1-090 

1 - 0 C 4 

1-066 

1-068 

1-071 

1-073 

1-053 

1-055 

1-057 

1-059 

1-061 

1-045 

1-046 

1-048 

1-050 

1-051 

1-038 

1-040 

1-041 

1-043 

1-044 

1-029 

1-030 

1-031 

1-032 

1-033 

1-021 

1-021 

1-022 

1-023 

1-023 

1-013 

1-013 

1-014 

1-014 

1-015 

1-006 

1-007 

1-007 

1-007 

1-007 

1-002 

1-002 

1-002 

1-002 

1-002 

1-000 

1-000 

1-000 

1-000 

1-000 

67-6728 

72-1632 

76-1920 

81-0688 

84-5019 
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Table 7.3. (cont.) 


p=3,r=20 

__ A _ 


P=3,r=22 

■ Afc i 


\a 

A/\ 

0-100 

0'050 

0-025 

0-010 

0-005 

0-100 

0-050 

0-025 

0-010 

1 

1-932 

2-021 

2-106 

2-216 

2-297 

1-975 

2-067 

2-156 

2-269 

O 

1-644 

1-580 

1-614 

1-057 

1-689 

1-678 

1-616 

1-651 

1-696 

3 

1-387 

1-408 

1-428 

1-453 

1-472 

1-415 

1-438 

1-459 

1-485 

4 

1-208 

1-313 

1-327 

1-344 

1-356 

1-322 

1-338 

1-353 

1-371 

5 

1-240 

1-251 

1-261 

1-274 

1-283 

1-261 

1-273 

1-284 

1-297 

6 

1-199 

1-208 

1-216 

1-226 

1-233 

1-218 

1-227 

1-236 

1-246 

7 

1-168 

1-176 

1-182 

1-190 

1-196 

1-185 

1-193 

1-200 

1 - 2 Q 9 

8 

1-145 

1-151 

1-157 

1-163 

1-168 

1-160 

1-167 

1-173 

1-180 

9 

1-127 

1-132 

1-136 

1-142 

1-146 

1-142 

1-147 

1-151 

1-157 

10 

1-112 

1-116 

1-120 

1-125 

1-128 

1-124 

1-129 

1-133 

1-139 

12 

1-089 

1-092 

1-095 

1-099 

1-102 

1-099 

1-103 

1-106 

1-110 

14 

1-073 

1-075 

1-078 

1-081 

1-083 

1-082 

1-085 

1-087 

1-091 

in 

1-061 

1-063 

1-065 

1-067 

1-069 

1-069 

1-071 

1-073 

1-076 

X VI 

1 ft 

1-052 

1-053 

1-056 

1-057 

1-059 

1-059 

1-061 

1-063 

1-065 

20 

1-044 

1-046 

1-048 

1-049 

1-050 

1-051 

1-052 

1-054 

1-056 

24 

30 

40 

60 

120 

OO 

1-034 

1-024 

1-015 

1-008 

1-002 

1-000 

1-036 

1-025 

1-016 

1-008 

1-002 

1-000 

1-036 

1-026 

1-016 

1-008 

1-002 

1-000 

1-038 

1-027 

1-017 

1-009 

1-002 

1-000 

1-039 

1-027 

1-017 

1-009 

1-003 

1-000 

1-039 

1-028 

1-018 

1-009 

1-003 

1-000 

1-040 

1-029 

1-018 

1-009 

1-003 

1-000 

1-041 

1-030 

1-019 

1-010 

1-003 

1-000 

1-043 

1-031 

1-020 

1-010 

1-003 

1-000 

w 

2 

V 

74-3970 

790819 

83-2976 

88-3794 

91-9617 

81-0865 

85-9649 

90-3489 

96-6267 

A pr 










0005 


2-353 

1-729 

1-604 

1-384 

1-307 

1-264 

1-215 

1-186 

1-161 

1-141 

1-113 

1-093 

1-078 

1-066 

1-057 

1-044 

1-031 

1-020 

1-010 

1-003 

1-000 

99-3304 
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Table 7.3 (cont.) 





p =4, r = 

-A- 

=12 



P : 

=4,r=14 


\<x 0100 

0-050 

0-025 

0-010 

0-005 

0-100 

0-050 

0-026 

0-010 

jU \ 

1 1-638 

1-700 

1-700 

1-838 

1-895 

1-686 

1-751 

1-814 

1-896 

2 

1-350 

1-373 

1-396 

1-424 

1-446 

1-388 

1-413 

1-436 

1-467 

3 

1-238 

1-252 

1-264 

1-280 

1-292 

1-269 

1-284 

1-297 

1-314 

4 

1-177 

1-186 

1-195 

1-205 

1-213 

1-203 

1-213 

1-222 

1-234 

6 

1-139 

1-145 

1-152 

1-169 

1-165 

1-161 

1-168 

1-175 

1-183 

0 

1-112 

1-118 

1-122 

1-128 

1-132 

1-131 

1137 

1-142 

1-149 

7 

1-093 

1097 

1101 

1-106 

1-109 

1-110 

1-115 

1-119 

1-124 

8 

1079 

1-0 S 2 

1-085 

1-089 

1-092 

1-094 

1-097 

1-101 

1-105 

9 

1-068 

1-070 

1-073 

1-076 

1-079 

1-081 

1-084 

1-087 

1-091 

10 

1059 

1-0 G 1 

1-003 

1-066 

1-068 

1-071 

1073 

1-076 

1-079 

12 

1-046 

1-047 

1-049 

1-051 

1-053 

1-055 

1-058 

1-059 

1-062 

14 

1037 

1 038 

1-039 

1-041 

1-042 

1-046 

1-046 

1-048 

1-050 

16 

1030 

1-031 

1-032 

1-033 

1-034 

1-037 

1-038 

1-039 

1-041 

18 

1-025 

1-026 

1-027 

1-028 

1-029 

1031 

1-032 

1-033 

1-034 

20 

1-021 

1-022 

1-023 

1024 

1-024 

1-026 

1-027 

1-028 

1-029 

24 

1-016 

1017 

1-017 

.1-018 

1-018 

1-020 

1-021 

1-021 

1-022 

30 

1-011 

1011 

1-012 

1-012 

1013 

1-014 

1-014 

1-015 

1-016 

40 

1-007 

1-007 

1-007 

1-008 

1-008 

1-009 

1-009 

1-009 

1-009 

60 

1-003 

1-003 

1-004 

1-004 

1-004 

1-004 

1-004 

1-005 

1-005 

120 

1-001 

1-001 

1-001 

1-001 

1-001 

1-001 

1-001 

1-001 

1-001 

CD 

2 

l-oo’o 

1-000 

1-000 

1-000 

1-000 

1-000 

1-000 

1-000 

1-000 

*pr 

.60-9066 

65-1708 

69 0226 

73-6826 

76-9688 

69-8185 

74-4683 

78-5671 

83-5134 



P 

=4,r=16 



P = 

: 4, r=18 


V 

A/\ 

0-100 

s. 

0-050 

0025 

0-010 

0-005 

0-100 

0-050 

0-025 

0-010 

1 

1-731 

1-799 

1-864 

1-949 

2-012, 

1-773 

1-843 

1-911 

1-999 

2 

1-423 

1-450 

1-475 

1-507 

1-531 

1-457 

1-485 

1-511 

1-545 

3 

1-299 

1-314 

1-329 

1-347 

1-300 

1-327 

1-343 

1-359 

1-378 

4 

1-228 

1-239 

1-249 

1-201 

1-270 

1-252 

1-264 

1-274 

1-287 

5 

1-182 

MOO 

1-108 

1-207 

1-213 

1-203 

1-212 

1-220 

1-230 

6 

1-150 

1-157 

1-103 

1-109 

1-174 

1-169 

1-170 

1-182 

1-189 

7 

M 27 

1-132 

1130 

1-142 

1-140 

1-143 

1-149 

1-164 

A X 0(7 

1-160 

8 

1108 

1-113 

1-117 

1-121 

1-125 

1-123 

1-128 

1-132 

1-137 

9 

10 

1-094 

1-083 

1-008 

1-080 

1101 

1-089 

1-105 

1-092 

1-108 

1-094 

1-107 

1-095 

1-111 

1-098 

1-115 

1-101 

A A 17 4 

1-119 

1-105 

12 
• j 

1-005 

1-008 

1-070 

1-073 

1-074 

1-075 

1-078 

1-080 

1-083 

i * 

16 

14 

20 

1-053 

1-044 

1-037 

1-032 

1-065 

1-046 

1-038 

1-033 

1-060 

1047 

1-040 

1-034 

1-058 

1-049 

1-041 

1-035 

1-000 

1-050 

1-042 

1-030 

1-061 

1-051 

1-044 

1-037 

1-003 

1-053 

1-045 

1-039 

1-065 

1-054 

1-046 

1-040 

A V/UU 

1-068 

1-066 

1-048 

1-041 

21 

3 o 

40 

‘-0 

1:.| 

<30 

Y 2 - 

1-024 

1 017 
1-011 
1005 
1001 
1-000 

1025 

1-018 

1-011 

1-005 

1-002 

1-000 

1-020 
1-018 . 
1011 
1-006 
1-002 
1-000 

1-027 

1-019 

1-012 

1-006 

1-002 

1-000 

1-027 

1-019 

1-012 

1-006 

1-002 

1-000 

1-029 

1-020 

1-013 

1-006 

1-002 

1-000 

1-030 

1-021 

1-013 

1-007 

1-002 

1-000 

1-030 

1-022 

1-014 

1-007 

1-002 

1-000 

1-031 

1-022 

1*014 

1-007 

1-002 

1-000 

X pr' 

8-8597 

83-6753 88-0040 

93-2168 

96-8781 

87-7431 

92-8083 

97-3631 ! 

L 02-816 


0-005 


1-956 

1-489 

1-327 

1-242 

1-189 

1-154 

1-128 

1-109 

1-093 

1-081 

1-064 

1-051 

1-042 

1-036 

1-030 

1-023 

1-016 

1-010 

1-006 

1-001 

1-000 

86-9937 


0-005 


2-066 

1-670 

1-302 

1-207 

1-237 

1-196 

1-164 

1-141 

1-122 

1-108 

1-086 

1-060 

1-068 

1-040 

1-042 

1-032 

1-023 

1-014 

1-007 

1-002 

1-000 

106-648 




Table 7-3 (cent.) 


P=5,r=12 


0050 

1-643 

1-350 

1-240 

1-179 

1-141 

1-114 

1-095 

1-081 

1-070 

1-001 

1047 

1-038 

1-031 

1-026 

1-022 

1-017 

1-012 

1-007 

1-004 

1-001 

1-000 

70 - 0811 ) 


P=5,r=14 

0-050 n ., 


P=5,r=16 


1 - 7 G 8 

1-396 

1 - 2 G 5 

1-196 

1-153 

1-124 

1-103 

1-087 

1-075 

1-065 

1-051 

1-040 

1-033 

1-028 

1-024 

1-018 

1-012 

1-008 

1-004 

1-001 

1-000 

88-3794 


1-683 

1-383 

1-267 

1-203 

1-161 

1-132 

1-111 

1-095 

1-082 

1-072 

1-057 

1-045 

1-038 

1-032 

1-027 

1-021 

1-014 

1-009 

1-004 

1-001 

1-000 

90-5312 


1-813 

1-431 

1-204 

1-221 

1-174 

1-143 
M 19 
1-102 
1-088 ‘ 
1-077 

1-060 

1-048 

1-040 

1-034 

1-029 

1-022 

1-015 

1-010 

1-005 

1001 

1-000 

100-4250 


1-722 

1-416 

1-294 

1-226 

1-181 

1-150 

1-127 

1-109 

1-095 

1-083 

1-066 

1-053 

1-045 

1-038 

1-033 

1-025 

1-018 

1 - 011 . 

1-006 

1-002 

1-000 

101-8790 


1-855 

1-465 

1-323 

1-245 

1-196 

1-161 

1-136 

1-116 

1-101 

1-089 

1-070 

1-057 

1-047 

1-040 

1-034 

1-026 

1-019 

1-012 

1-006 

. 1-002 
1-000 

112-3290 


p = 6 , r=ll 


p=6,r=12 


p=6,r=13 


1-589 

1-321 

1-220 

1-164 

1-129 

1-105 

1-088 

1-074 

1064 

1-055 

1-044 

1-035 

1-029 

1-024 

1020 

1-015 

1-011 

1-007 

1-003 

1-001 

1-000 

85-9649 


1-704 

1-363 

1-243 

1-180 

1-140 

1-114 

1-094 

1-080 

1-069 

1-060 

1-047 

1-037 

1-030 

1-026 

1-022 

1-016 

1-011 

1-007 

1-003 

1-001 

1-000 

95-6257 


1-605 

1-335 

1-232 

1-175 

1-138 

1-113 

1-095 

1-081 

1-070 

1-061 

1-048 

1-038 

1-032 

1-027 

1-023 

1-017 

1-012 

1-007 

1-004 

1-001 

1-000 

92-8083 


1-722 

1-378 

1-255 

1-191 

1-150 

1-122 

1-102 

1-086 

1-075 

1-065 

1-051 

1-040 

1-033 

1-028 

1-024 

1-018 

1-013 

1-008 

1-004 

1-001 

1-000 

102-8160 


0-050 

-\ 

0-010 

1-621 

1-739 

1-349 

1-393 

1-244 

1-268 

1-185 

1-201 

1-148 

1-159 

1-121 

1-130 

1-102 

1-109 

1-088 

1-093 

1-075 

1-080 

1-066 

1-070 

. 1-052 

1-055 

1-042 

1-044 

1-035 

1037 

1-029 

1-031 

1-025 

1-026 


1-019 

1-013 

1-008 

1-004 

1-001 

1-000 

09-6169 


1-020 

1-014 

1-009 

1-004 

1-001 

1-000 

109-9680 
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Table 7.3 (cont.) 


p=4,r=20 


\ 

a. 0-100 

0-050 

0-025 

0*010 

0-005 

M 

\ 





1 

1-812 

1-884 

1-954 

2-045 

2-113 

2 

1-488 

1-518 

1-645 

1-580 

1-606 

3 

1-353 

1-371 

1-387 

1-408 

1-422 

4 

1-275 

1-288 

1-299 

1-313 

1-323 

5 

1-224 

1-233 

1-241 

1-252 

1-259 

6 

1-187 

1-194 

1-201 

1-208 

1-216 

7 

1-159 

1*105 

1*170 

1-177 

1-182 

8 

1*138 

1-143 

1-147 

1*153 

1-157 

II 

1-121 

1-125 

1-129 

1*133 

1-137 

10 

1-107 

1-110 

1-114 

1-118 

1*121 

12 

1-086 

1-088 

1-091 

1-094 

1-090 

14 

1-070 

1-072 

1-074 

1-077 

1-078 

16 

1-059 

1-061 

1-062 

1-064 

1-066 

18 

1-050 

1-052 

1-053 

1-055 

1-056 

20 

1*043 

1-045 

1-040 

1-047 

1-048 

24 

1-033 

1-034 

1-035 

1-030 

1-037 

30 

1*084 

1-024 

i -025 

1-020 

1-020 

40 

1-015 

1-010 

1*010 

1-016 

1-017 

60 

1-008 

1-008 

1-008 

1-008 

1-008 

120 

1-002 

1-002 

1-002 

1-002 

1-002 

00 

o 

1-000 

1-000 

1*000 

1-000 

1-000 

*pr 

96-5782 

101-879 

100-029 

112-320 

110-321 


P=4,r=22 

/-*-, 


0-100 

0-050 

0-025 

0-010 

0-005 

1-848 

1-922 

1-994 

2-088 

2-168 

1-518 

1-649 

1-577 

1-614 

1-641 

1-379 

1-397 

1-414 

1-436 

1-461 

1-298 

1-310 

1-322 

1-337 

1-347 

1-243 

1-253 

1-262 

1-273 

1-281 

1-204 

1-212 

1-219 

1-228 

1-234 

1-176 

1-181 

1-187 

1-104 

1-109 

1-152 

1*157 

1-162 

1-168 

1*172 

1*134 

1-138 

1-142 

1-147 

1-151 

1-119 

1 123 

1-126 

1-130 

1-134 

1-095 

1-098 

1-101 

1-104 

1-107 

1-079 

1-081 

1-083 

1-086 

1-088 

1-066 

1-068 

1 070 

1-072 

1-074 

1-057 

1-068 

1-060 

1-062 

1-063 

1-040 

1-061 

1-052 

1-053 

1-055 

1-038 

1-030 

1-040 

1-041 

1-042 

1-027 

1-028 

1-020 

1-030 

1-030 

1-017 

1-018 

1-018 

1-019 

1*019 

1-009 

1-009 

1-009 

1-010 

1-010 

1-003 

1*003 

1-003 

1-003 

1-003 

1-000 

1-000 

1-000 

1-000 

1-000 

105-372 

110-808 

115-841 

121-767 

125-013 


Source: K.C.S. Pillai and A.K. 
pp.109-118. 


Gupta (1969). Biometrika 
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Multivariate general 

EXERCISE 7 

The following data relating to the first lactation yield 

(X 1 ,X 2 ) in 10 lb units of ( X]L ) and of daughters (x ) of 5 

sires are taken from the records of Kankrej herd maintained at ., 
the Institute of Agriculture, Anand,India ( Source : National index 

of field experiments M.P.6, Indian Agricultural Research Institute, 
New Delhi, India). 


Sire 1 

Sire 2 

Sire 3 

Sire 4 

Sire 5 

216,258 

153,112 

274,313 

340,340 

152,180 

216,178 

276,269 

375,374 

251,347 

375,236 

269,243 

260,247 

312,320 

169,193 

198,120 

236,178 

299,198 

229,532 

208,235 

242,140 

156,111 

298,357 

171,203 

182,252 

269,227 

280,233 

266,327 

197,269 

221,280 

221,140 

190, 60 

281,200 

226,153 

193,194 

532,277 

340,262 

186,190 

260,228 

312,169 

187,179 

216,191 

188,261 

166,184 

299,242 


277,138 

168,235 

252,383 

188,253 


121,197 

251,165 




261,234 





190,123 






Analyse the data to test the hypothesis that the population means 
pertaining to different sires are equal. 


2. The following bivariate data (x^jX^), x^ - total green 
weight in lbs. of jute plants per plot, x 2 - total dry fibre 
weight of jute per plot were collected from a fertilizer trial, 
\rith 12 different combinations of (a) 3 levels: - 0 lb per 

jcre, P = 40 lb per acre, P 2 = 80 lb per acre of P 2 0 5 as 
superphosphate, (b) 2 levels: » 0 = « lb per acre, = 20 lb 

jer acre of nitrogen as ammonium sulphate, (c) 2 levels: 

: = 0 lb per acre = .25 lb per acre of K 2 0 as murate of 

>otash, on a particular variety of jute crop. The 
experiment was conducted in Sagar-Bishnupur, West Bengal, Indxa 
,nd the layout was a randomized block design four blocks 

>f 18 ' xl9 ' (net) each. 





e»W t ! 


[#J 


HS; 

nB 

PI 


88.3,5.66 
93.4,6.31 


m 


7.42 

106 

8.34 

140 

8.24 

109 

7.60 

98. 

7.18 

104 


7.17 

101.8,6.70 

10.0 

111.6,7.57 

7.40 

112.4,7.36 

.50 

92.6,6.24 

7.00 

115.8,7.95 

8.90 

115.0,7.85 

7.40 

118.3,7.90 

8.24 

104.9,6.68 

7.06 

94.0,6.35 

7.13 

104.9,7.10 

8.91 

106.1,7.06 

7.40 

121.0,8.50 


Analyze the data to examine if the different fertilizers have 
different effects on this particular variety of Jute. 
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CHAPTER 8. NONPARAMETR 


IC ANAL ^.S OF Wriance 


Nonparametric procedures are e 
development of statistical method ■ C ° ncerned with the 
assumption regarding the fun Wlthout makin 9 any explicit 

distributions of the ° 10nal form of the probability 

concerned with cases where^he^rT 0 " 8 ' U£>til °° W " e were 

univariate or multivariate -ormal iiTth ^T “ knOW " ^ ** 

with cases where W o do n , S chapter we sha11 deal 

functional f e ° ” lake £ “ y explicit assumption about the 

functional form of the univariate sample observations. The 

multivariate analogue of these results are more complicated and 

we refer to Puri and Sen (1971) for multivariate procedures 

analogues to the univariate ones discussed in this chapter. 

The relative performance of various nonparametric methods 
are usually studied for large samples and the study of their smal] 
sample properties often involves quite cumbersome computational 
scheme. The development of the asymptotic theory of nonparametric 
statistical inference is somewhat spotty and piecemeal as comparec 
to its parametric counterparts. We refer to Puri and Sen (19 71) 
and Lehmann (19 75) for further readings and complete references 
on this topic. In our development of univariate nonparametric 
analysis of variance we mainly state various results without proo 
and we refer to above references for their proof. Before we star 
discussing the one-way and higher-way classifications we deal 
briefly some relevant results on the estimation in one-sample 
and two-sample problems and testing of hypotheses of two-sample 

location parameters. 

8.1. ONE -SAMPLE LOCATION PARAMETER (ESTIMATION) 

are distributed according to a common 
Suppose that Q) whose cen tre of symmetry 0 is to be 

symmetric distribution ^ absolutely continuous, when I 

estimated. Let us assume tha known that 

, finite variance it i- w 

is normal with 
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0 


1 n 
± 2 


( 8 . 1 ) 


X i 


wai-iance unbiased estimator of 0. This estimator 
■i„ the minimum variance . 

... 4-0 crross errors and tends to be inefficient 

is very sensitive to gross errui. . . _ . _ 

th heavy tails. An example of this fact as provaded by the 
Cauchy distribution for which the distributaon of 8 for any „ 


{Jd.UCXAV wl-LO Ui - 

is the same as X x so that the average is not better than the 


is UIIC ocuuc - . . . . 

single observation. In view of this fact that the dastrabutaon 

of x,-s,...A- ! is symmetric about the origan, the best 
estimator of 6 is that value of 8 for which the values X^e, 

.. . r X -0 give the best balance relative to the origin- 
Consider a test statistic MX^,...,.^) for testing the hypothesis 
that the common distribution of X-^-0,. . .,X n ~0 is symmetric 
about the origin, Obviously the variables X^-0,...,X n ~@ will be 
best balanced about the origin when the test statistic h 
evaluated for X^ - 0, ..., X n - 0 takes on the value giving the 
strongest support to the hypothesis mentioned above. This will 
typically be the case when h takes on its central value. One 
may thus expect the resulting estimator to share some of the 
properties of the test from which it is derived. A few commonly 
used test statistics 


are as follows. 


(i) Sign-test statistics ; Evaluated for the variables X-^-0, 
...fX^-e it is the number among the X.^-0's which are positive. 
Obviously, it gives the greatest support to the hypothesis, that 
the variables Xj-0,... ,X n -0 are symmetric about 0, if half of 
the variables X A -0 are positive and half are negative and it 

happens if 0 is the median of the X ± . So the sign test leads to 
the estimate 


where med(X.) denotes the median of the x 
x i* 

Ui) Wi lc °xon Signed rank test statistic : Evaluated for the 
variables it is the number of pairs (i,j) with i < j for which 












, - e+x. - e is, ^ 
i J ^° s itiv e 

tl alf of the sums y and it 

a. . — 04«y 

negative. Thus th* 1 J 8 ' £ °- 


n ^MET Rlc 


ANALYs 


it- *. 

■ t : e+x r«. ■«» *. 
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A e = med H(x +x ) J r 1 < i < J < „ he med ian Q f the 

I i 13 1 r - ■ ~ n ' that iSi 

I ( see Ho ^ges and L o hm 

I L -hmann (1963) (8.3) 

f Examp le « i ' faen (1963) f nr * 

1 5.4, 7 ^xi\ c °»^ the sample o ;;° r 

I estimate the centre e’oi ^ 9 ^^utioiT iJlT ? * 4 ' 
f symmetric about e v the di stribution which . S esired to 
1 - fi n rr, V * Fr °m the sion * . ^assumed to be 

I' * ° get 9 first observe tv * 3 StatiStic 0= *15.8 -h 6 . 2 ) 

■ with i < j and the median of the “ there are 21 sums Sj (x.« .) 

I smallest. On ordering the oh se s ™s will be the nth 1 J 

las 5.1, 5.4, 5.8, 6 . 2 , 74 S ® rVatlon = we rearrange the sample 
I sums is given by Jj( 5 [ 4+7 ' 6) 1 ' . S ° th ® Uth smEaiest of the 

I 9-6= med(i^(x.+x i_e) 

J 1 j' 

I = med(J s (x i - 0 )+J s ( X _ 0 ) 

land the distribution of X .-8 and X -9 is independent of 9 .S -9 is 

^independent of 0 . Similar results hold forl^-e and e-e. 

\^Furthermore, from problem 1 if the distribution of X ... x is 

| symmetric about 0 the same is true about the distribution of 5 

10 and 0. This result shows that each of the three estimators 

I are unbiased for 0 and each is as likely to overestimate 0 as to 

I underestimate it : It is known that (see for example Lehmann (1975) 

[ 0 and 3 are considerably more efficient than 0 when p is normal 

with then 6 being slightly more efficient than 9 . But for 

distributions with tails heavier than the normal e may be 

considerably more efficient than e and in general it appears to 

be a reasonable one. On the other hand when the observations can 
be a reasonab ^ fee con , puted easily and is usually 

easily be ordere Purth ermore if F has finite 

used when F is not sy estimator of 9 and if F has a 

expectation then 8 1S ~ a . C ° nS cons i S tent estimator of 9. if 9 be 

unique median 0 then 0 


Cer itral 


valu e if 
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such that 

p{!5(X+Y) < 9}= PWX+Y) > e >= *5 

where X, Y ere independently distributed ar F and if 6 is the oni y 

point with this property then e is consistent for 9. 

8.2. TWO-SAMPLE LOCATION PARAMETERS (ESTIMATION AND TESTING) 

■ , . v v.v Y, N=m+n, are independent samples 

Suppose that X—' m' 1'***' n . , 

from the absolutely continuous distribution functions F(x) and 

G(x) = F(x-e). This is called an additive model (also shift model) 
which assumes that the treatment adds a constant amount 0 to the 
control response of the subject independent of the value of that 
response. Since the variables •••' x m ;Y i“J''* *' Y n“ 0 have the 
same distribution we may then estimate 0 by 0 - the amount by 
which the Y values have to be shifted to give the Y values the best 
possible agreement with the X—values. To achieve this let Us 
consider a test•statistic 

*’ r • 

h(X 1 _,X m ?Y 1( ...,Y n ) = h(X,Y) (say) 

for testing H q : 0=0 against alternatives 0^0. Suppose that under 

H the test statistic h(X,Y) is symmetrically distributed about 
o 

some value y and let the ( two sided test rejects H q whenever 

h (X, Y) 21 V + c or h(X, Y) <. y - c. 

Then the v^lue of h(X,Y) which is most supportive of H q is 
the value closest to the centre y of the distribution. The two 

A. , ■ A 

sets X.,, ... / X and -0 will therefore be considered 

l m l 

as being in the best agreement if they assign to the statistic 

A A 

h(x^/ .. * / X^y Y^—9/ ••• / Y^— 0) this closest value•. A commonly used 
test statistic h(X,Y) is the Wilcoxon statistic y . By defini¬ 

tion W x y _ e is the number of pairs (i,j) for which Y.-Q > 
or Yj - X_^ > 0. So W x Y _g will take on its central value if 
half of the differences Y^-X^^ are greater than 6 and half are 
less than 0. Thus the value q of @ for which this is achieved 
is the median of the mn differences Y j -X i (see Hodges and 

Lehmann (1963) and Sen (1963) for details). Let d.^., _' d ( mn ) 

denote the ordered set of mn differences Y^-x^ 
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•t x «Vu* T' *■ «* 

vv.e > w*«»—» xr 
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d a tie (k+1) i ^ X i'VJ is the pair with Y x - 

(k+1) a le occurs between X. and Y -9 TakinfI tv, j'* 1 

of a tie to w tv., 1 J ‘ Tklg the contribution 

w _v 4 .v *' Y “ 0 14 £ollows that the associated value o£ 

X,Y-8 k i and we take 0 = d (k+1) =med(Y.-X.). 

\ Jt oan be verified that in the shift model under the assump- 

^tion that F is continuous the distribution 0-0=med(Y -x )-0 is 

' a j i 

independent of 0 . Also 0 is symmetrically distributed about 0 
if either the distribution F is symmetric about 0 or if m=n and 

A A 

this implies that E(6)=0. Furthermore 6 is consistent for esti¬ 
mating 0 when both m, n ■* ». 

We shall now consider an alternative expression for V» x y in 
terms of a statistic R which is commonly known as the Mann-Whitney 

statistic. 
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assume without any loss^of generally ^ 


' v - analysis of variance 

Id prove (8.5) l et fche rank of V, i* Kj there are r^ 

Y < Y < ■■■ < V „ of which is smaller than y 

Observations among the X. 

Similarly there are Rj- 

smaller than Y 2 and so on. 


, 2 ‘observations among the X.. each of W hi otl 


Hence 


is 


W. 


X,Y 


> <*1- 1+R 2' 1+ 


+ V" 


n(n+1) 
= R - 2 


Let us now consider the problem of testing the hypothesis H^o. 
In deciding an appropriate test it is not enough to specify H(j 
alone. We should also specify the alternatives against which it 

is tested. The model 

( 8 . 6 ) 


G(x) = F(x-G) 

for all x expresses the fact that the distribution G is obtained 

by shifting F.by an amount 6 >0. In other words the model (8.6) 

supposes that the treatment adds a constant positive amount 6 to 

the' control response. In the shift model it is then natural to 

consider the alternative hypothesis H^: 9 > 0. The Wilcoxon test 

of H against H, rejects H whenever 
o 1 o 


W > C 
X, Y — 


alternatives H, 


unbiased for testing against H^. 


where the constant C is determined so that the test has size a J 
It is known that the power function of the test against the 

1^: 9 > 0 is a nondecreasing function of 0 and it is 

It is worth noting that for 
testing H q we have tacitly excluded the case that the treatment 
effect is negative, i.e. 0 < 0. it follows that the power of the 
test (8.7) is less than the level if significance a for 0 < 0. 
Thus if the hypothesis H q is replaced by H' : 9 < 0 that the 

treatment is no better than the control the test (8.7) of size a 
is also a level a test of H 1 . 
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The overall comparison of the Wilcoxon test over its normal 
test tends t”!!* 1 * ^ Student ' ,t test suggests that the Wilcoxon 
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efficient then the t-test (for detail C ° nside tabl y more 

To evaluate the constant c ' "* Leh " ann(1975 »»■ 
for example, by Buckle Kraft extensi ve tables are available, 

and Wilcoxon, Kattl and Wii. ^ 

In actual practice we very often meet with a difficulty of 
tie ranks which arises when two more observations are equal. In 
such a situation it Is customary to assign the equal observations 
their average rank or which is commonly called the mid rank. 

Example 8.2 . Let m = 2, n = 3. The observations are 2.2, 2, 
2.6, 3.1, 3.5. The smallest observation 2.2 has rank 1, the 

largest observation 3.5 has rank 5, the second largest observation 
3 1 has rank 4. We do not know which of the two observations 2.6 
has rank 2 and which has rank 3. We assign to each of the observ 

tions 2.6 the mid rank is(2+3) =2.5. 

, H .0 = 0 it can be verified that E(W x , y )=^nm, 

Under H . 0 under H E(R)=Hnln+m+, 

# , m xiW12 Thus from (8.5) under h q 

Var (W y ) - nm(n ’ large (see for example Lehman 
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ANALYSIS OF VARIANCE 

... cable) is extremely useful. Under H q it can 
this case “ PP . 1 n(n+m+1) . » other words the expect^ 

^ ttrnliLe/statistic resins unaffected tor tied 
observations. But the variance of R* depends on the number of 
observations tied at different values. Assume that n + m observe. 

tion X..X ; Y,.V n thke on only P distrnot values. Let 

k of the n + m valGes be equal to the smallest value. k 2 of the 
n+m values be equal to the next smallest value, .. . , * p to the 
largest. Thep 

p 3 

ntn E (k.-k.) 


,_. L1 nm (n+m+1) 1_ / D 

Var (R*) = - rj- --“ 77 * l 8 -'a 

12(n+m)(n+m-1' 

In general the second term in the right hand side of (8.7a) 

(correction term) is quite small although exceptions are not 

completely ruled out* If max k./(n+m) is -bounded away from 1 

i 

as n + m -*■ 00 then under H q [R*-E (R*) ]/Var (R*) is also normally 
distributed with mean 0 and variance 1 as m -*■ n ->■ °°. 

Typically the normal approximation for R* is not as close as 
that of R for the same group sizes. The ties tend to decrease 
the number of values taken on by. R and thereby decrease the 
lumpiness of its distribution. Lehmann (1961) has done a number 
of comparisons of the exact and the approximate distribution of 
R with and without continuity correction and has observed that 
these comparisons are on the whole favourable to the continuity 
correction. 

For testing H q : 6 = 0 against two sided alternatives H : 6^0 
the Wilcoxon test based on R rejects H whenever 

R - C l “ R — C 2 (C 1 <C 2> 0 (8. 

where c y C 2 are constants such that P H (R < c^) + p (r > c )=a 

When H o is true. In cases where two treatments play°s y mmetric 
r ° * Xt 1S natural to choose C 1 , c 2 such that under H q 

P H ( R 1 C 1 ) = p„ (R > c J -.a 


/ a 
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For the fact that the distribut¬ 
ion (n+m+1) we can have from 1011 ° f R iS symmetric about 

test based on r reiect* ^ 8 * 8) and (8 ‘ 8a) tba t the Wilcoxon 

J ts H q whenever 

IR- %n (n+m+1) | > c or . 

equivalently |W xy - ^ nm | > C (8.8b) 

where ^tj { | W -i$mn I > = — 

o a, y * a * the presence of ties we replace 

(8.8). Since the distribution of R* is not symmetric 

we cannot have (8 8b1 _ _ 

' terms of R . But instead we can use 

(8.8a) to get C anH r n 

1 2" For l ar 9 e m,n we can use the normal 

approximation under H q , as discussed above to determine c. and C, 
in both cases. 1 z 

8.3. ONE-WAY CLASSIFICATION (LOCATION MODEL) 

Consider p(^ 2) independent sets of random variables X.., i = 

. j_ J 

1, . - • / ru , i - 1/ . . ., p. Let X^, . . ., X_^ n be a random sample of 

size n. from the continuous distributions F.(x), i = 1,_, p and 

let 


F^(x) = F(x-6J for all x. 

We first consider the problem of estimating the differences 

A.. = 9.-0 . From (8.4) 

U 1 J 

A 

A■• = med (X. -X ) 

“ij la j 3 


(8.9) 


( 8 . 10 ) 


the median of the n.n. differences X -X , a = l,...,n.,3 = 

1 j ia J 3 1 

1 n is a reasonable estimator of A. .. However these 

j ij 

estimators are not compatible in the following sense. Suppose 
that we want to estimate Q 2 ~ Q 1’ when we estimate it directly 
we get A 21 . Since 0 2 _0 i = e 3'' e i +0 2 _0 3 we can take estimator of 
A 21 as the sum of estimators A 31 #-A 23 of ^31 and A 23 respectively. 
Obviously A 2r A 31 + A 23 may differ. To avoid this awkwardness 

Lehmann (1963) proposed to replace (8.10) by 

A. . = A - A . (8 * n 

ij i. • J 

where A = SA ,/p and showed that these adjusted estimators of 

k. j kj 
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first express it in 


terms of the elementary 
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uniquely defined ( though the cl.^ 

efficiency of 1 relative to the normal theory classical 

estimator Z Z d i ■ - (X i-” X j - } * S the Sam8 &S that ° f A ij relative | 
x # jn the case of unequal sample sizes it is preferable 

to use I 2d. .(A -A. ). The large sample solutions of the 

problem of finding simultaneous confidence intervals for all 


contrasts are discussed by Sen (1966) , Puri and Sen (1971), 
Marascuilo (1971) . 


Tests of hypotheses . Let N = E n.. Let us rank N 

i 


observations X , i = l,...,p, j = 1,_,n. is ascending order om 

i J i 


magnitude and let R denote the rank of X . We are 

ij 

interested here in testing the null hypothesis H q : 0 1 =. . .=0 p . 
in deciding on an appropriate test we should also need to 


specify the alternatives against which H is being tested, in 
the analysis of variance model the q ± represent the responses of 
P different treatments under consideration and we assume that 


treatments mainly affect the response levels. Under this 
model there is a natural order among the treatments in the sense 

lowest a *j ends t0 9iVe the lowest response, another gives the nex 
treatment in^l *^ ndlCatl °" of the position of a particular 
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here R . n . R 1 _, and the constant c depends on the size a of 

e test. For p = 2 the statistic Q reduces to the square of the 
ilcoxon statistic W Under H 
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To deal with cases of tied observations let us suppose that 

the N observations X^. take on n distinct values. Let observa 

tions are equal to the smallest value, k^ observations are equal 

to the second smallest,..., and k^ observations are equal to the 

largest. Let R*. be the midrank of X. . and R*. = £ r* Tha 

ij ij i . ij Iie 

Kruskal and Wallis test of for tied observations is given by- 

rejecting H q whenever 


Q* = 


12 

N(N+1) 


- 3(N+1) 


1 - Z (k 3 k )/<N 3 -N) 
i 1 1 


(8.15) 


where the constant C depends on the size a of the test. Obviously 
the computation of the distribution of' Q* under H is very 
laborious and tables are impracticable. However except for very 
small values of N the distribution of Q* under H can be 
approximated by the central chi-square distribution with p-1 
degrees of freedom. 

If the hypothesis =...=6 p is rejected by the Kruskal 

and Wallis test there is a necessity of dividing the treatments 
into groups, such that the treatments having identical responses 
belong to the same group and those having different responses 

belong to different groups; and to find the best treatment. We 
shall now discuss these below. 

C omparison of new treatments with a control . Suppose that 
the treatments under consideration consists of a standard treat¬ 
ment (control) and (p-1) new treatments which are possible 
improvements over the standard one. To. find the group of new 
treatments which are better than the control we can apply the 
Wilcoxon test (Section 8.2) to compare each new treatment with 
t e control. Assume without any loss of generality that the ith 

new treatment has n., i = 2.. observations and the control 

one has ^ observations. Thus to apply the Wilcoxon test we 
ombrne the n. observations for the ith new one with n, observe¬ 
rs for the control and rank them. Let R,, i = be the 








nonparametric analys 
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sum of the ranks of n. observation. for the _ ' 

Jt may be noted that the R. ig d lth new treatment. 

in < 8 - 13> which are obtained from ** 33 de£lnea 

* S n X + + n p observations. We judge the ith n \ 

to be superior to the control of treatment 

Sj - C i' 1 = 2 '-»P 

1 x la i c 



1 (8.16) 

rfhere the constant c denendQ +. 1 ^ i • i 

i epenas on the level of significance a. 

of the test and are determined from the table of the Wilcoxon* 

statistic W x y . in particular if n 2 = ... = n and if the new 

treatments play a sufficiently symmetric role, one would then 

be inclined to choose a 0 =...=a = a (say), the C. will then be 

<c P i 

equal, that is, C =_= c = c (say), and (8.16) reduces to 

^ P 

R. > C (8.17) 

1 — 

where the constant C is determined such that P{R^ c) = a under 
the hypothesis that the ith new treatment is not superior tc the 
control. In terms of the shift model (8.9) our aim is to select 
those new treatments for which 0. - e x > 0. The test procedure 
(8.17) achieves this aim by performing a set of tests of hypo¬ 
theses 0 . - 6 - <0 against the alternatives 0 . - 0 1 > 0 such 

that the maximum probability of falsely rejecting at leastJ»e^ 

of the hypotheses 6 , i - ^ i °' 1 " 2 . . ° CCUrS ” ^ 1 p 

. 1 , , ■--j — of the treatment. In comparing p treatments 

- face the pr oblem of selecting the best of a 

it is natural ^ slmples t procedure is to select the 

number ° f ' [ a "g est ‘average rank as obtained from the combined 

one with t +n observations. In actual practice 

ranking of N n l * ” P select first a subgroup that is 

sometimes it is advantageous treatment. Having 

reasonably certain to contain ^ wouia then carry 

narrowed down the ch ° 1Ce select the best in this subgroup, 

out more extensive experiment (1 „ 0) suggested 

n acting the subgroup Gupta treatment in the 

For selecting include t he ith 

the following 
subgroup if 
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(1) 

( 2 ) 

(3) 


R. > max R. 

i* “ i 1 * 

max R. 

l. 


R. > 

l. — 


C 


R. > C 
1 . — 


where C is a constant such that 


( 8 . 18 ) 

(8.19) 

( 8 . 20 ) 


P{correct selection} j> y (specified). (8.21) 

For the shift model (8.9) with additional assumption that 
n l ~*** =n p=n (say) and 0^ < 0^ <...< 0 Gupta and McDonald 
have shown that for any fixed value of 0 the probability of 
correct selection with the rule (8.20) decreases as 0^, ...,0 
increases towards 0 p and hence the probability of correct 
selection tends to its minimum value as @^...,0 ■ tends to 0 

The value of C with the rule (8.20) is given by P{R >c}=y ^ 

whenever 8 1 =...=0 p where nR^ is the sum of ranks.of*X r ... /X 
in the joint ranking of the X_ and has the distribution of the” 
Wilcoxon statistic with group sizes m = n(p-l),n. The determina¬ 
tion of C for the rules (8.18) and (8.19) is not very straight¬ 
forward and we refer to Gupta and McDonald (1970) for these 
determinations.. Lehmann (1963a), Rizvi and Woodworth (1970) 
and Ghosh (1973) have also discussed this problem from a 
different view point. 


8.4. EXPERIMENTS WITH TWO OR MORE FACTORS 

In such experiments two or more factors each at two or more 
levels are considered simultaneously. Some of the factors are 
of real interest to the experimenter while the others are 
introduced to reduce the variability of the experimental 
materials and thereby increase the accuracy of the estimators 
and the statistical test procedures. For the simplicity of 
presentation we shall consider only the complete block designs 
in details. However we shall give proper references to 
incomplete blocks and nonorthogonal design. m deriving 
suitable nonparametric test statistics we shall consider the 
method of "n-ranking" and the method of "aligning" only. 
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T d iL nOW r Sl3er the m ° ael 0f »'t«-«, classification 

where m different treatments ara ■ 

re a PPlxed to each of n different 

bl ° C S 3 each block contains exactly m treatments and each 

treatment is replicated exactly n times. Let X. ., j - 1.m, 

i ' ' be tbe ran dom variable associated with the yield of . 

the jth treatment in the ith block. Let X. . be independent and 
let F ±j ( ? c)= P {x ij < x}. 1J 

In the parametric case, as we have seen, F, .(x)=F(x-y-0.-0.) 

ij i. J 

with Z 8^-0, Z 0. = 0 where the 8. is the ith block effect, 
x j J i 

the 0j is the jth treatment effect and y is the grand mean and 

also we require the assumption that F(x) is the distribution 

function of a normal random variable with mean 0 and variance 
2 

a . The null hypothesis in this case is 0.. =...=0 =0 which in 

1 m 

this additive model implies that 

F ij ( x> = -•• = F. m (x) = FU-p-e.) . 

The method of n-ranking assumes neither the additivity nor 
that the F. . differ only in location. The normality of the F„, 
ttie additivity of the 8^ and the homocadesticity of the errors 
are not required. We shall assume throughout that the F_ are 
continuous. Under this assumption we formulate the null 
hypothesis H q of no treatment effects as 

H : F. , (x) =...= F (x) = F (x) (say) (8.22 

O ll - LUl 

± = 1 _ nm In this formulation the treatment j is stochasti¬ 

cally better or worse than the treatment j ' if 

p{X > X ijf } > h or < h 

for^H i with the strict inequality holding for atleast one i. 

u-ornative to H is H-: atleast one of the 
The hypotheses ^ worse than the rest . B 

treatments is ^ ^ fche specif ica tions of H q and ^ become 

the nonpara ^ the additivity of the treatment 

quite simpler it we 

effects in the sense that ^ 

„ /v ) i = l,...,m,i-l -,n 

. (x) = ( x " 6 j ' J 

In such cases H becomes 
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(6.24) 

6 1 ••• =S "' 

H , e . * 0 at least for one j. Let R ±J be the rank of X ±j 

1 J since F are continuous by assumption 

among X.ij , ...... _ 

ties among observations may be ignored in probability. Let 
R = (R _ _ R ). under H ‘ the distribution of R ± over m! 

possibl^realizations is uniform whatever may be the form of F ± 

provided that they are continuous. 

Various tests based on the R^j have been proposed. But 

the one which is often used is proposed by Friedman (1937). We 

refer to Puri and Sen (1971) and Lehmann (1975)/ Kendall (1970) 

for other tests and relevant results. The Friedman tests based 

on R.. rejects H whenever 
ij o 


nm(m+lj 


(R - Siftii ) 2 


12 

nm(m+l) 


m „ 

E R, - 3n (m+1) >_ C 
1 2 


(8.25) 


AX 

where R. = E R. the sum of the ranks for the treatment j and 
J i=l 1J 

the constant C depends on the level of significance a of the test. 

Tables of the distribution of Q under H q is given by Friedman(1937] 

Owen (1962), Kraft and Van Eeden (1968) and Kendall .(1970) . For 

n and m large Q is distributed as X^_ 1 (chi-square with m-1 

degrees of freedom) whenever H holds. 

o 

For tied observations in the same block we assign the tied 
observations the midranks (the average of the ranks in the same 
block with which the observations are tied) . Ties among 
observations in different blocks cause no difficulty since the 
observations in the same block are being ranked to obtain the 

test statistic Q. Suppose the m observations in the ith block 
take t i distinct values with d. ., i = i, ... # n# j _ x t 

Of the observations being equal'to the jth'largest value'among 
the t i distinct values in the ith block. Let R*. be the 
midrank of the observation x ij and let R? = ? lj R » . The 

hypothesis H witv, . . J i=l 

° h ® tied observa tions is rejected whenever 
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ls 0F variance 


1 J .3 

t.*” ^helistrib ° depenas « thA 1J aiJ>/ftm(m2 ' ln - C <••*« 

are large. Typ^^* Q *«*.r « « the 

Q is quite reasonahi E ° hl ' ?qUare °approx V 1 > ' hen n ana » 
m = 2 reduces to th e , Whenever mn > 30 . _ lr “ atlcm E »r both q and 

is more effective fo/thi ^ 31,3 * he *" 

Friedman test is reU t lv ^ £•*« it has been 

Xt IS mainly due to the fj t that *“ l ™ valu « ° £ - » a 

provides comparisons oniv th. SeParate ranl<lng ln each block 
of the variations between bio ” sp “ ses ' withl " aa <* block. Becauss 
t««>» h lah - ”>— 

insistently low responses, a direct 
comparison of responses in different blocks is not, in general, 
very meaningful. This difficulty can be eliminated by subtract¬ 
ing from the observations in each block some estimate of the 
location in each block such as the average of the observations 
of the block or the median of these observations. This technique 
is commonly called the method of aligning. Contrary to the 
method, of n-ranking the method of aligning assumes the additivity 
of the block effects and the homoscedasticity of the errors of 
‘different blocks. In other words to use the technique of align¬ 
ing we assume that ^ 2 

, A >o i=l,...,n, 

v = u +a. + 6 . << 1 ' 


Y = U +01 . 

x ij • 1 


, a are - 

• -i-vxd orand mean* cx- / n 

1 n (mav be rU™ or fixed • .j.;--. 


the block effects 


where y 

-»»- - - - 
rri:'—--• »“—“■* 

^ ■ u i‘ r:;r» 


= 0 (may t e 


) 1 # i — x • 

£. = (£ t i . . ' fflrs having conti^ 

1 _ random veC _ -nrh that 


—i -*■ vectors t eac n — 

are independent random ^ . .„ such th as . sped* 

. F (Xw •••'*!» ' 0 bvi° usly ■ aentically distribu 

function Fj/ i arauments- , nt and i den 

i. ..Ch Of its marg .^pendent 


case i 
error 


the 


:lassi ( 


case 


comp° n 


ents ^ij 
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The technique 


of aligning consists of eliminating the block 


effects or by taking 

Y ij = X ij " X i (8 - 2 .8) 

where X .is a suitable estimator of the location' of the ith block. 
We shall consider here the case X. = , the average of the ith 

block. Let us now arrange Y ±J . = X ± j - X_ L , 1 - j = l,... f] 

in order of magnitudes and let R^j be the rank (or mid-rank in the 

case of tied Y. .). The . are called the aligned rank of the Y, 

Under H : t) =...-6 =0, that is, under the hypothesis of no treat- 
o \1 m 

ment effect regardless of which position in a block were assigned 
to which treatment, the mi possible arrangements in any given 
block will simply result in mi permutation of the aligned ranks 
in this block. The total number of equally likely permutations 
within the n sets of aligned ranks in all blocks is (m!) n . 

A 

Therefore the distribution of R. . under H is given by 

ij o J 

A A 

P{R ij = r ij ' 1 = 1 . . J = X . m > = (ml) ~ n . 


Let R. = 1 R.. 

J IJ 


— R . 
n J 


Under H £ R. = ^m(mn+l). 
o j . 


A test 


based on r of H q which is analogous to the Friedman Q test 


rejects H whenever 
o 


1 1(R.,-R 

i j 1J 


in /\ 

2 ~ t 2 (R,-^n(n m+l)]‘ 

i. } J= 1- J 


= m-1 _ r m "2 2 n 

' z Z(r...r )2 C * R j-' 4mn (m n+1 > 2 ] 

i j u i- J 

2 c 

where R. = — y R , 

m u ij the constant c depends on the level o 

significance a of the test, since * z J. 2 

all ranks or mid _ ranks 1 J lJ ^ SUm ° f SqUa 

° f . m i» »• bioc k is ; ixea ; on a ly c °::: ant : simiiariy ' - the 

m being the sum of the ranks of iv, ^ bein9 random) ' 

constants. Hence ith block ate also 
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(8 -2 8) 

^b lock _ 

h6 ith 
= i... 

_ -i *»m 

k in t he 
t the Y 

ass igned 

iven 

ranks 

ations 


test 


2 test 


re 1 °^ 

squares 

. t he set 
the 

>m; / 


1 1 * (S iJ" R i- ) * l ] R U 2 r 5 ?<» s t )* 

i J J i • L * ' • 

. _ a constant. 
i» 

j For small values of m, n the distribution of h under H 

f iS T* “71 However for large vaiues of m, n 5 is ° 

j distributed as . yor computational purposes, the test 

based on Q is more laborious than the test based on Q. To 

see if this extra effort i <? uarranho^ •, 

1S war ranted it is necessary to know 

by how much it will increase the efficiency of this procedure. 

Under the assumption of additivity and under the assumption of 

normality of F. , (x) the optimum test of H :0. = ...=e =0 rejects 
, J o 1 m 

H whenever 
o 


S n (X -X ) 2 /(m-l) 

] * J * * 

2 2 (X..-X .) 2 /(m-l)(n-1) 

i j *J 


'(8.30) 


— ml 

wnere the constant C depends on the size a of the test and 

under H the statistic U has central F-distribution with (m-1), 

( n _l) (m-1) degrees of freedom. We are interested in the 

Friedman test based on Q and Q mainly because we are not sure 

i about the normality assumption of the distribution function F. 

V Since the distribution of Q (or Q> under H q is independent of 

1 the form of the distribution function F, the level of significance 

a of the Friedman test based on Q (or Q) is thus independent of 

the form of the distribution function F. Obviously this is not 

ttue for the test based on 0 where the level of significance does 

* , „ n th e form of distribution function F. However if 

depend on o£ y u „ der t ends to x^x irrespective 

" r If F provided the variance of the distribution F is 

of the form of F p rais ed above in connection with 

£inlte . T o Illltiiy compare the Friedman 

the test based on Q. based on U in terms of 

test with the normal ^ de£lne a as the limiting value 

their Pitman effi ciency of b i oc ks required by t e 

as n - - of the ratio of the n™ ^ the same power against 

two tests of the same size a 

r i 
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alternatives. 


and Noether (1959) showed 


the same 
that the 
F>-test is 0 
0.769 for m 
for m 


^ofthr^iedman test relative to ths 
Pitman efficiency fnr m = 3, 0.764 for m = 4 , 


o 0 716 for m 
.637 for m = * = ^ 0>89 5 _for m = 15 and 0.955 

= 5- °- 868 £ ° r m . rovpment of the Friedman test 


Hence the need of WP ^ smaU and is less 

through alignment is greatest 


is large, 


Mehra and Sarangi (1967) have show* 
of the test based on Q relative to 
as m increases and that 


important when m 
that the Pitman efficiency 

the Friedman test based on Q m = 3 , 1.263 for m - 4, 

it is eqUal !°, 1 * 5 /° r for = m ^ m this context it is worth 

1.210 for m - an _ Q ± equivalent to the Wilcoxcn 

noting that the test based on Q is equi 

nmhlpm described earlier in this 
test for the two-sample problem aes^i. 

chapter. 

So far we have considered in this section complete two- 
factor layout, that is, the size of the blocks is equal to the 
number of treatments being compared. However it is frequently 
necessary to have, blocks size less than m such as balanced 
incomplete block designs or partially incomplete block designs, 
etc. Conversely it may also happen that the size of the blocks 
is greater than m. The Friedman test for the absence of treat¬ 
ment effects is generalized to cases where the block size is 
different than m and in particular by Durbin (1951) , Bernard 
and van Eltern ( 1953), and Sen (1968a, 1968b, 1977) to the 
balanced incomplete block designs. Mehra and Sarangi (1967), 
and Sarangi and Mehra (1969) have generalized the test based on 
Q to the case where the number of observation n. . on the jth 
treatment in the ith block is not necessarily equal to 1 
(nonorthogonal data). 

Consider a balanced incomplete block design with m treat¬ 
ments arranged in b blocks each containing -k treatments <* < 

th SaCh treatment ls re P lic ated r (jc b) times in such a way' 
pair of rSatment occurs more than once in any block and each 

in X bl oc kS . x* - 



in the ith block we denote by R.. the 

1 U 
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’rank oi its corresponding response among the k responses in the 

ith block. Otherwise we let R = o. Let R = v r , - 
. ~ . 1J j " ij r J ~ 

For testing the null hypothesis H q of no treatment 

effec ts the test analogous to the Friedman test rejects H 

o 

whenever 


12 ^ 

° = ml(k+l) ^ C R j- >ir(k+l )] 2 > c 


where the constant C depends on the level of significance a of 

~ _ o 

the test and under H Q is distributed as x , when both n,m 

o m-1 

are large. 

EXERCISE 8 

1. Show that if the distribution of X , ...,X is symmetric 

^ ^ A 

about 0 then the distribution of 3,6 and e of section 8.J. are 
also symmetric about 0. 

2. Show that in the two-sample shift model 

( a ) 0 _e is independent of 9 when F is continuous. 

(b) 6 is symmetrically distributed about 0 if either the 
distribution of F is symmetric about 0 or if m = n. 

3. Show that under H q : 0= 0 E(W X y )= ^mn, Var(W x y ) 

= nm(n+m+l) . 

-L/- 

4. aama : Davies, O.L. (1954), Design and analysis of 
industrial experiments, Oliver and Boyd, London. 

The following data were collected from a wear-testing 
experiment with 4 materials A test prece o eac 

material was taken from each of the 3 positions of 

mho Toss in weight due to wear was measured 
testing machine. The loss in weig 

i- material ib rni 11 i^f3.ms• 

on each piece or materia 
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Table 

O.X. 

M 1 

M 2 

M 3 

V 

P 1 

241 

195 

235 

234 

P 2 

■. 270 

241 

273 

236 

P 3 

274 

218 

230 

227 


* 


due to materials. ■■ . ■ *>>>■; 

5 The following observations were made from an experiment 
conducted in West Bengal, India with 4 different varieties of 
Wheat on the average number of grain per 10 ears of 14 randomly 
selected plants for each variety. 


Table 8.2. 

Average 

Number of Grains 

per 10 ears 

Plant 

Variety 

1 Variety 2 

Variety 3 

Variety 4 

number 





1 

261 

351 

322 

323 

2 

320 

424 

5 r, . 

380 

310 

t 

3 

287 

370 

260 

310 

4 

308 

328 

310 

395 

5 

311 

380 

278 

376 

6 

281 

374 

306 

401 

7 

254 

357 

307 

, s ’’ 

412 

8 

346 

282 

346 

• 389 

9 

290 

/ 404 

337 

368 

10 

287 

412 

285 

356 

11 

320 

390 

272 

328 

12 

307 

400 

276 

360 

13 

328 

406 

280 

405 
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analyse the data to test the null hypothesis of no variety 

effect. 

6. The following data on the average length of ear (in cm) 
for each of 10 randomly selected plants were recorded from an 
experiment with 3 varieties of barley crop. 

Table 8.3. Average length of ears (in cm) 


Plants number 

Variety 1 

Variety 2 

Variety 3 

1 

8.97 

10.19 

9.65 

2 

9.63 

9.47 

9.76 

3 

9.59 

10.75 

10.52 

4 

9.58 

10.48 

9.78 

5 

9.02 

10.21 

9.30 

6 

14.40 

10.79 

9.80 

7 

10.58 

11.06 

10.30 

8 

9.32 

11.14 

9.32 

9 

9.50 

11.43 

9.68 

10 

10.40 

11.12 

9.70 


Analyse the data to test the null hypothesis of no variety 
effect. 


1. source : Damaser, Shore and Orne (1963). Psychological 
■ffects during hyphonically required emotions. Psychosomatic 
ledicine, 25, pp. 334-343 (also reported in Lehmann (1975)). 

In a study of hypnosis the emotions of fear, happiness, 
epression and calmness were requested in a random order from 
ach of 8 subjects during hypnosis. Table 8.4 gives the 
esulting measurements of skin potential adjusted to initial 

evel in millivolts. 
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. . ski n potential injnillivolte 

Table 8.4. SKin v - - -- 


subjects 

4 


Emotions -“ 3 4 5 _-_ 

---10-5 23.6 11.9 54.6 21.0 

"" 22 \ 51.2 9.7 19.6 13-8 47.1 11.6 

Happiness . • w _ 8 21-1 13.7 39.2 1,.7 

Depression ■ ' a „. 6 13.3 37.0 , 14.8 

Calmness _ 22 • \ __— ” 

Analyse the data to test the hypothesis of no emotion effect. 

S source: C.w. Loel (1952) Industrial statistics, Business 
Books Ltd.. WO Fleet Street London EC4.G.B. 

Chemical reaction take place in solutions and when one of the 
reactants is also a good solvent for the other reactants it is 
customary to run the reaction in presence, of a large excess of 
solvent reactants. At the end of the reaction the excess of the 
solvent reactant has to be recovered for reuse and often it is 
found to be contaminated with some impurity formed in the reac- 
tion. If this impurity content increases on repeated recovery,, 
it may inhibit the reaction and it is necessary to clean the 
solvent by some means. One way in which an estimate may be made 
of whether or not the solvent needs cleansing is to check the 
reaction batch time cycles. Table 8.5 exhibits the data collected . 
from an experiment in which five reactors R 2> R 3 , R^ # R 5 and runs 

Table 8.5. Order of starting batches 


Block 


1. 

(r 5 ) 

18.4 

<r 3 > 

18.0 

( r 2 ) 

18.4 

( R x ) 

18.2 

(V 

18.0 

2. 

<v 

18.4 

(R 1 > 

18.6 

(r 5 > 

19.0 

(r 2 > 

19.0 

( V 

18.6 

3. 

(R 2 } 

19.5 

(R.) 

4 

18.9 

(R 3> 

19.6 

<v 

20.0 

‘V 

19.3 


were blocked so that Block 1 deals with runs made With fresh 
vent, Block 2 deals with runs made with some fresh and some 
recovered solvent and Block 3 deals with runs with some fresh and 
some solvent which has been through the process twice. Batches 
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Ex* assigned to the various reactors in random order. Analyse 
Ipfche dat & to test the hypotheses of no block effects. 
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